This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 



BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of 
the original documents submitted by the applicant. 

Defects in the images may include (but are not limited to): 

• BLACK BORDERS 

• TEXT CUT OFF AT TOP, BOTTOM OR SIDES 

• FADED TEXT 

• ILLEGIBLE TEXT 

• SKEWED/SLANTED IMAGES 

• COLORED PHOTOS 

• BLACK OR VERY BLACK AND WHITE DARK PHOTOS 

• GRAY SCALE DOCUMENTS 

IMAGES ARE BEST AVAILABLE COPY. 



As rescanning documents will not correct images, 
please do not report the images to the 
Image Problem Mailbox. 



^ 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 ■ 
C12N 9/64 



Al 



(11) International Publication Number: WO 98/40475 

(43) International Publication Date: 17 September 1998 (17.09.98) 



(21) International Application Number: PCT/US98/04694 

(22) International Filing Date: 1 1 March 1998 (1 1 .03.98) 



(30) Priority Data: 
08/814,394 



11 March 1997(11.03.97) 



US 



(71) Applicant: ABBOTT LABORATORIES [US/US]; CHAD 

0377/AP6D-2, 100 Abbott Park Road, Abbott Park, IL 
60064-3500 (US). 

(72) Inventors: FALDUTO, Michael; 1124 Greenbay Road, Glen- 

coe, IL 60022 (US). MAGNUSON, Scott, R.; 2360 Oak 
Tree Lane, Park Ridge, IL 60068 (US). MORGAN, Dou- 
glas, W.; 1009 Havenwood Drive, Libertyville, IL 60048 
(US). 

(74) Agents: CASUTO, Dianne et al.; Abbott Laboratories, Chad 
0377/AP6D-2, 100 Abbott Park Road, Abbott Park, IL 
60064-3500 (US). 



(81) Designated States: CA, JP, MX, European patent (AT, BE, 
CH, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, MC, NL, 
PT, SE). 



Published 

With international search report. 

Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) Title: HUMAN MATRIX METALLOPROTEASE GENE, PROTEINS ENCODED THEREFROM AND METHODS OF USING 
SAME 



(57) Abstract 



The present invention provides novel complementary DNA (cDNA) sequences encoding human matrix metalloprotease 19 proteins. 
The present invention also provides recombinant DNA molecules encoding matrix metalloprotease 19 polypeptides and processes for 
producing the novel proteins. The cDNA is cloned into expression vectors for expression in recombinant hosts. The cDNA is useful 
to produce recombinant full length MMP19s or fragments thereof. The cDNA and the recombinant proteins derived therefrom and/or 
antibodies to the proteins are useful in diagnostic assays and for the development of therapeutic agents that affect MMP function. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCX 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


UZ 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


ZW 


Zimbabwe 


CI 


Cote d'lvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






cz 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







WO 98/40475 



PCTYUS98/04694 



HUMAN MATRIX METALLOPROTEASE GENE, PROTEINS ENCODED THEREFROM 

AND METHODS OF USING SAME 

Background of the Invention 

5 

1. Field of the Invention 

The present invention relates generally to therapeutic and diagnostic agents in cancer 
and inflammatory diseases. More specifically, the present invention relates to polynucleotide 
sequences encoding matrix metalloprotease proteins, as well as methods which utilize these 
1 0 sequences, which are useful for the detection, diagnosis, staging, monitoring, prognosis, 
prevention, or treatment of cancer or inflammatory diseases. 

2. Description of the Related Art 

The matrix metalloproteases (MMPs) are a multi-enzyme family capable of completely 

1 5 degrading the components of the extracellular matrix (ECM), their natural substrates (W.B. 

Ennis, and L. M. Matrisian, 7. Neuro-Oncology, 18: 105-109 (1994)). The ECM is a 
meshwork of cells and various types of collagens and proteoglycans, collectively called 
connective tissue, which provides mechanical support and helps to maintain the structural 
integrity of tissues and organs. The function of the ECM is particularly apparent in articular 

2 0 cartilage where it provides cushioning and ease of movement between bones in joints. The 

MMPs are secreted by the cellular components of the ECM (fibroblasts, chondrocytes and 
synoviocytes) and inflammatory cells (neutrophils and macrophages) in inactive forms 
(zymogens) which are converted extracellularly to the active enzymes by various proteinases. 
Normally MMPs function in a highly regulated fashion as part of the physiological turnover of 

2 5 the ECM, effectively renewing and remodeling the ECM. However, in the clinical features of 

several diseases, the ECM is degraded and there is much evidence to support that MMPs play a 
significant pathological role in ECM degradation. 

At least fourteen members of the MMP family have been identified and most assigned 
EC numbers; several more have been discovered recently. They can be classified generally 

3 0 according to four subgroups based on substrate preference or cellular localization; e.g., 

collagenases prefer Type I and II collagen, gelatinases prefer Type IV collagen, and 
stromelysins prefer proteoglycans (L. M. Matrisian, Bio. Essays 14: 455-463 (1992)). The 
fourth subgroup are the membrane type MMPs (mtMMPs) which are characterized by the 
presence of a hydrophobic transmembrane domain near the C-terminus for anchoring the 
3 5 protein in the cell membrane. All of the other MMPs are secreted into the extracellular milieu. 

Most of the known MMPs contain zinc in their catalytic sites and require calcium for 
activity. The major human MMPs have been cloned and exhibit greater than 50% homology. 
They contain a leader sequence for signaling their secretion by cells; a highly conserved pro- 
enzyme sequence removed upon activation; a catalytic site with a highly conserved zinc 
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binding domain; and a carboxy terminal region containing a conserved sequence similar to 
hemopexin, a heme binding protein. Although MMPs can be readily activated in vitro using 
mercurial compounds or trypsin, the precise mechanism for propeptide removal and activation 
of MMPs in vivo is not understood. Some MMPs can undergo an autoactivation process, 
5 while recent evidence indicates that membrane type MMPs may function as activators of other 
MMPs. 

A growing line of evidence implicates the MMPs as important enzymes in cancer 
metastasis. Although different cancer cell lines have been shown to express various MMPs 
when grown in culture, gelatinase A in particular has been the focus of a number of recent 
1 0 studies which demonstrates its role in the invasiveness of cancer cells (W. G. Stetler- 

Stevenson, et al, FASEB J. 7: 1434-1441 (1993)). For example, gelatinase A is found in the 
urine of bladder cancer patients and specific monoclonal antibodies have been used to detect 
the enzyme in breast tumor sections (I.M. Margulies, et aL 9 l: 467-474 (1992)). 
The enzyme is expressed in an invasive prostate cancer cell line (PC-3 ML) and cells 

1 5 transfected with the gelatinase A gene are capable of extravasation when injected into mice (M. 

E. Stearns, and M. Wang, Oncology Res. 6: 195-201 (1994)). These studies and others 
implicate the involvement of gelatinase A in tumor metastasis, and suggest that inhibitors of 
this enzyme may offer therapeutic potential in certain forms of cancer. A broad spectrum 
matrix metalloproteinase inhibitor has been shown to decrease the tumor burden of mice 

2 0 bearing ovarian carcinoma xenographs (B. Davies, et al, Cancer Res. 53: 2087-2091 

(1993)). This compound (BB-94, batimastat) is currently being evaluated in clinical trials for 
malignant ascites (S.A. Watson, etal, Cancer Research, 55: 3629-3633 (1995)); however, its 
poor bioavailability necessitates parental administration. Gelatinase A selective succinyl 
hydroxamates have been suggested as anti-cancer agents as well, yet these compounds possess 

2 5 the same peptidic backbone as batimastat (Porter, J. R.; Beeley, N. R. A.; Boyce, B. A.; 

Mason, B.; Millican, A.; Millar, K.; Leonard, J.; Morphy, J. R.; O'Connell, J. P. Potent and 
selective inhibitors of gelatinase A 1. Hydroxamic acid derivatives. Bioorg. Med Chem. Lett., 
4: 2741-2746 (1994)). More recently, an orally active, broad spectrum, MMP inhibitor (BB- 
2516, marimastat) was reported to stop progression of colorectal, ovarian, prostatic and 

3 0 pancreatic cancer (R. P. Beckett, et ai, D. D. T., 1: 16-26 (1996)). 

Several lines of evidence indicate that the unregulated activity of MMPs is responsible 
for the joint degradation observed in rheumatoid arthritis and osteoarthritis. In these human 
arthritides, activated forms of the MMPs and their products (glycosaminoglycans and collagen 
fragments) are found in synovial fluids and joint tissues in abnormally high amounts (see E.D. 
3 5 Harris Jr., Role of collagenase in joint destruction. The Joints and Synovial Fluid, Vol., 1 , 
Sokologg, L., Ed., Orlando, FL, Academic Press, 1977, T.E. Cawston, et ai, Arthritis 
Rheum., 27: 285-290 (1984), Cawston, T. Ann. Rheumatic Diseases., 52: 769-770 (1993), 
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Y.L. Okada, et al, ^ Biol. Chem., 261: 14245-14255 (1986), D.L. Scott, et al, Molec. 
Aspects Med., 12: 341-394 (1991), and E.D. Harris, et al, Arthritis Rheum., 12: 92-102 
(1992)). These arthritic tissues also show a greater-than-normal expression of MMPs (S.S. 
McCachren, Arthritis and Rheumatism , 34: 1085-1093 (1991), A.J.P. Docherty and G. 
5 Murphy, Ann. Rheumat. Dis., 49: 469-479 (1990)) which is induced by cytokines and 

growth factors, also found abundantly in these tissues (D.L. Scott, et al, (1991), supra, S.M. 
Frisch and H.E. Ruley, J. Biol Chem., 262: 16300-16304 (1987)). In addition, the 
activities of MMPs in normal tissues are thought to be regulated by the presence of 
endogenous tissue inhibitors of MMPs, (TIMPs, see T.E. Cawston, Curr.Med. Lit. Rheum., 
10 3:1 27-0 ( 1 984)). The ratio of the amounts of TIMP and MMPs is thought to maintain a 

balance between the rates of degradation and synthesis of ECM. In tissues from rheumatoid 
arthritics, an abnormally high expression of MMPs results in an imbalance of these enzymes 
and degradation of ECM (S.S. McCachren (1991) and T.E. Cawston (1984), supra). Thus in 
arthritis, inhibition of the exacerbated degradative activities of MMPs by specific agents could 

1 5 help restore this balance. In rodent models which mimic the biochemical features of arthritis, 

there is evidence that the combined action of proteoglycan loss (due to stromelysin activity) 
and cartilage degradation (due to collagenase) are early events in this disease (R.M. Hembry, 
et al, Am. J. Pathol, 143: 628-642 (1993), K.A. Hasty, et al, Arthritis and Rheumatism , 
33: 388-397 (1990)). Several prototype inhibitors of MMPs have been shown to reduce 

2 0 cartilage degradation in these animal models (M.J. DiMartino, et al, J. Cell Biochem. suppl., 

19E: 179 (1991), P. Brown, etal, Orally active inhibitors of cartilage degradation. Abst. # 
81, Abstracts of Inflammation '93, Vienna, Austria, p. 29. (1993)). 

There is much evidence to suggest that MMPs mediate the migration of inflammatory 
cells into endothelium (D., Moscatelli and D.B. Rifkin, Biochim. Biophys. Acta., 948: 67-85 

2 5 (1988), P. Zaoui, et al, Matrix metalloproteases (MMP) exocytosis from neutrophils is 

inhibited by endothelial adhesion. Abst. # 83, Abstracts of Inflammation '93, Vienna, Austria, 
p. 29 (1993)) participating in periodontal diseases (H. Birkedal-Hansen, J. Peridontol, 64: 
474-484 (1993)) and facilitating the growth of atherosclerotic plaques (A. M. Henny, et al, 
Proc. Natl Acad. Sci., 8: 8154-8158 (1991)). Recently, gelatinase-A was reported to 

3 0 promote cleavage of the amyloid protein precursor which would suggest a role in Alzheimers 

disease for this MMP (N. Peress, et al, J. Neuropathol Exp. Neurol, 54: 16-22 (1995), 
R.N. Lipage, etal, FEBS Lett., 377: 267-270 (1995)). Thus, there is compelling evidence 
that MMPs play an important role in arthritis and other inflammatory diseases and that targeted 
inhibition of these proteinases by pharmaceutical agents could have beneficial effects. 
3 5 It would be advantageous to provide specific methods and reagents for the diagnosis, 

staging, prognosis, monitoring, prevention or treatment of diseases and conditions associated 
with imbalances in the production or activity of MMPs or to indicate possible predisposition to 
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these conditions. Such methods would include assaying a test sample for products of the 
gene. Such methods would comprise making cDNA from mRNA in the test sample, 
amplifying (when necessary) portions of the cDNA corresponding to the gene or a fragment 
thereof, and detecting the cDNA product as an indication of the presence of the cancer; or 
5 detecting translation products of the mRNAs comprising gene sequences as an indication of the 
presence of the disease. These reagents include polynucleotide(s), or fragments) thereof 
which may be used in diagnostic methods such as reverse transcriptase-polymerase chain 
reaction (RT-PCR), PCR, or hybridization assays of biopsied tissue; or proteins which are the 
translation products of such mRNAs; or antibodies directed against these proteins. Such 
1 0 assays would include methods for assaying a sample for product(s) of the gene and detecting 
the product(s) as an indication of disease. Drug treatment or gene therapy for conditions or 
diseases associated with these detected diseases and conditions then can be based on these 
identified gene sequences or their expressed proteins, and efficacy of any particular therapy 
can be monitored using the diagnostic methods disclosed herein. 

1 5 Studies to understand the role of MMPs in tumor growth, metastases, or inflammatory 

conditions and diseases such as arthritis and their potential as a therapeutic or diagnostic tools 
are limited to previously described MMP proteins. It would be advantageous to identify novel 
human MMPs which may be directly involved in the physiology of, for example, tumor 
growth, extravasation, or invasion. Isolation of DNA sequences encoding human MMPs 

2 0 would permit more extensive studies on the association and regulation of individual MMPs in 

specific cancers or tumor types, and in diseases such as arthritis or other inflammatory 
conditions. In addition, the identification of tissue-specific or disease-specific MMPs would 
provide more direct targets for therapeutics designed to attenuate these diseases. 

2 5 Summary of the Invention 

This present invention provides an isolated and purified polynucleotide encoding a 
matrix metal loprotease (provisionally named MMP 19), polynucleotide fragments thereof, 
expression vectors containing those polynucleotides, host cells transformed with those 

3 0 expression vectors, processes for making the MMP protein using those polynucleotides and 

vectors, isolated and purified MMP protein and polypeptide fragments thereof, and antibodies 
raised to synthetic peptides derived from the MMP protein. The invention also provides 
diagnostic assays to identify the presence of the MMP polynucleotide or polypeptide, assays 
used to identify agents that affect the function of the MMP polynucleotide or polypeptide, and 
3 5 the use as therapeutic agents of the MMP polynucleotide, polypeptides, or antibodies. 

The cDNA clone was obtained by screening a human cDNA expression database with a 
consensus sequence to twelve other human MMPs. The sequence of the partial cDNA isolated 
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5 

indicated that the gene product is a novel MMP protein that is expressed in a limited number of 
tissues. 

Brief Description of the Drawings 

5 

FIG. 1 shows the MMP consensus sequence used to query the LifeSeq™ database. 
The functional motifs present in all MMPs are double underlined. 

FIG. 2 shows the complete nucleotide and translated sequence (frame 3) of EST 

1 0 #907334 in LifeSeq™. A putative cysteine switch motif (PRCGVTD which is SEQ ID NO:8) 

and a putative furin recognition site (RKKR which is SEQ ID NO:9) are double underlined. 

FIG. 3 shows the length and position of individual sequencing reactions to generate the 
double stranded nucleotide sequence of the insert in clone #907334. The boxes at the bottom 
15 are graphic representations of the three forward reading frames with start codons shown as 
short upward lines and stop codons as longer downward lines. Frame 1 contains the longest 
open reading frame. 

FIG. 4 shows the doubled stranded complete nucleotide sequence (SEQ ID NO: 1 and 

2 0 SEQ ID NO:2, top and bottom strand respectively) of the MMP19 gene and the translated 

amino acid sequence (SEQ ID NO:9) of MMP 19 (shown beneath in single letter codes). 
Indicated in bold type are the structural domains found in other MMPs. The cysteine switch, 
furin recognition site, and zinc binding site are double underlined. The putative poly 
adenylation signal in the 3* untranslated region of the mRNA is single underlined. Boxes 

2 5 indicate the position of the 3' end of the primers used in the synthesis of DNA constructs for 

the expression of the catalytic portion of MMP 19. 

FIG. 5 shows the alignment of the polypeptide encoded by SEQ ID NO: 10 with the 
amino acid sequences of all known human MMPs, namely, matrilysin (GenBank Accession 

3 0 No. L22524, SEQ ID NO: 1 8), Gelatinase A (GenBank Accession No. M55593, SEQ ID 

NO: 19), Gelatinase B (GenBank Accession No. J05070, SEQ ID NO:20), Stromelysin 1 
(GenBank Accession No. J03209, SEQ ID NO:21), Stromelysin 2 (GenBank Accession No. 
X07820, SEQ ID NO:22), Stromelysin 3, (GenBank Accession No. X57766, SEQ ID 
NO:23), Collagenase 1 (GenBank Accession No. Ml 3509, SEQ ID NO:24), Collagenase 2 
3 5 (GenBank Accession No. J05556, SEQ ID NO:25), Collagenase 3 (GenBank Accession No. 
X75308, SEQ ID NO:26), MMP 12 (GenBank Accession No. L23808, SEQ ID NO:27), 
MMP 18 (GenBank Accession No. Y08622 or X92521, SEQ ID NO:28), mtlMMP (GenBank 
Accession No. D26512, SEQ ID NO:29), mt2MMP (GenBank Accession No. Z48482, SEQ 
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6 

ID NO:30), mt3MMP (GenBank Accession No. D50477, SEQ ID NO:31), and mt4MMP 
(GenBank Accession No. X89576, SEQ ID NO:32). The consensus sequence was generated 
using a plurality of 5. Characteristic motifs found in MMPs are double underlined. 

5 FIG. 6 shows the alignment and tissue distibution of ESTs found in LifeSeq™ which 

are 100% identical to regions of MMP19 cDNA. 

FIG. 7 shows a Phospholmage of a Northern blot of human RNA (2p,g of poly A + 
RNA/lane) probed with a 32 P-labeled MMP19 cDNA probe. 

10 

Detailed Description of the Invention 

The present invention provides isolated and purified polynucleotides that encode a 
human matrix metalloprotease, fragments thereof, expression vectors containing those 

1 5 polynucleotides, host cells transformed with those expression vectors, a process of making the 

MMP using those polynucleotides and vectors, and isolated and purified recombinant MMP 
and polypeptide fragments thereof. 

The present invention also provides for the use of MMP 19 in developing treatments for 
any disorder mediated (directly or indirectly) by insufficient amounts or production of MMP 19 

2 0 protein. Purified human MMP19 protein may be administered to a patient with such a 

condition. Alternatively, gene therapy techniques for producing MMP19 polypeptide in vivo 
are also provided. 

The present invention also provides methods for assaying a test sample for products of 
the MMP gene, which comprises making cDNA from mRNA in the test sample, and detecting 

2 5 the cDNA as an indication of the presence of the MMP gene. The method may include an 

amplification step, wherein portions of the cDNA corresponding to the gene or fragment 
thereof is amplified. Methods also are provided for assaying for the translation products of 
mRNAs. Test samples which may be assayed by the methods provided herein include tissues, 
cells, body fluids and secretions. The present invention also provides reagents such as 

3 0 oligonucleotide primers and polypeptides which are useful in performing these methods. 

Portions of the nucleic acid sequences disclosed herein are useful as primers for the 
reverse transcription of RNA or for the amplification of cDNA; or as probes to determine the 
presence of certain cDNA sequences in test samples. Also disclosed are nucleic acid 
sequences which permit the production of encoded polypeptide sequences which are useful as 
3 5 standards or reagents in diagnostic immunoassays, targets for pharmaceutical screening assays 
and/or as components or target sites for various therapies. Monoclonal and polyclonal 
antibodies directed against at least one epitope contained within these polypeptide sequences 
are useful for diagnostic tests as well as delivery agents for therapeutic agents and for 
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screening for diseases or conditions associated with cancers, arthritis, or inflammation. 
Isolation of sequences of other portions of the gene of interest can be accomplished by utilizing 
probes or PCR primers derived from these nucleic acid sequences, thus allowing additional 
probes and polypeptides of the genome of interest to be established, which also will be useful 
5 in the diagnosis, prognosis and/or treatment of diseases and conditions characterized by the 
MMP gene disclosed herein. 

The techniques for determining the amino acid sequence "similarity" are well-known in 
the art. In general, "similarity" means the exact amino acid to amino acid comparison of two 
or more polypeptides at the appropriate place, where amino acids are identical or possess 
1 0 similar chemical and/or physical properties such as charge or hydrophobicity. A so-termed 

"percent similarity" then can be determined between the compared polypeptide sequences. The 
techniques for determining nucleic acid and amino acid sequence identity also are well known 
in the art and include determining the nucleotide sequence of the mRNA for that gene (usually 
via a cDNA intermediate) and determining the amino acid sequence encoded therein, and 

1 5 comparing this to a second amino acid sequence. In general, "identity" refers to an exact 

nucleotide to nucleotide or amino acid to amino acid correspondence of two polynucleotides or 
polypeptide sequences, respectively. Two or more polynucleotide sequences can be compared 
by determining their "percent identity." Two amino acid sequences likewise can be compared 
by determining their "percent identity." The programs available in the Wisconsin Sequence 

2 0 Analysis Package (available from Genetics Computer Group, Madison, WI), for example, the 

GAP program, are capable of calculating both the identity between two polynucleotides and the 
identity and similarity between two polypeptide sequences, respectively. Other programs for 
calculating identity or similarity between sequences are known in the art. 

The compositions and methods described herein will enable the identification of certain 

2 5 markers as indicative of cancer, arthritis, or inflammation resulting from or associated with an 

MMP19 disorder; the information obtained therefrom will aid in the diagnosis, staging, 
monitoring, prognosis and/or therapy of those same diseases or conditions. Test methods 
include, for example, probe assays which utilize the sequence(s) provided herein and which 
also may utilize nucleic acid amplification methods such as the polymerase chain reaction 

3 0 (PCR), the ligase chain reaction (LCR); and hybridization. In addition, the nucleotide 

sequences provided herein contain open reading frames from which one or more immunogenic 
epitopes may be found. Such epitopes are believed to be unique to the disease state or 
condition associated with MMP 19 related cancer, arthritis, or inflammation. The uniqueness 
of any epitope may be determined by its immunological reactivity with the specific MMP gene 
3 5 in diseased tissues and lack of immunological reactivity with non-diseased tissues. Methods 

for determining immunological reactivity are well-known and include but are not limited to, for 
example, radioimmunoassay (RIA), enzyme-linked immunosorbent assay (ELISA), 
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hemagglutination (HA), fluorescence polarization immunoassay (FPIA); chemiluminescent 
immunoassay (CLIA), and others; several examples of suitable methods are described herein. 
Unless otherwise stated, the following terms shall have the following meanings: 
A polynucleotide "derived from" a designated sequence refers to a polynucleotide 
5 sequence which is comprised of a sequence of approximately at least about 6 nucleotides, is 

preferably at least about 8 nucleotides, is more preferably at least about 10-12 nucleotides, and 
even more preferably is at least about 15-20 nucleotides corresponding, i.e., identical to or 
complementary to, a region of the designated nucleotide sequence. The sequence may be 
complementary to or identical to a sequence which is unique to a particular polynucleotide 
1 0 sequence as determined by techniques known in the art. Comparisions to sequences in 

databanks, for example, can be used as a method to determine the uniqueness of a designated 
sequence. Regions from which sequences may be derived include but are not limited to 
regions encoding specific epitopes, as well as non-translated and/or non-transcribed regions. 
The derived polynucleotide will not necessarily be derived physically from the 

1 5 nucleotide sequence of interest under study, but may be generated in any manner, including 

but not limited to chemical synthesis, replication, reverse transcription or transcription, which 
is based on the information provided by the sequence of bases in the region(s) from which the 
polynucleotide is derived; as such, it may represent either a sense or an antisense orientation of 
the original polynucleotide. In addition, combinations of regions corresponding to that of the 

2 0 designated sequence may be modified in ways known in the art to be consistent with an 

intended use. 

The term "primer" denotes a specific oligonucleotide sequence complementary to a 
target nucleotide sequence and used to hybridize to the target nucleotide sequence and serve as 
an initiation point for nucleotide polymerization catalyzed by either DNA polymerase, RNA 

2 5 polymerase or reverse transcriptase. 

The term "probe" denotes a defined nucleic acid segment (or nucleotide analog 
segment, i.e., PNA) which can be used to identify specific DNA or RNA present in samples 
bearing the complementary sequence. 

A "polypeptide" or "amino acid" sequence derived from a designated nucleic acid 

3 0 sequence refers to a polypeptide having an amino acid sequence identical to that of a 

polypeptide encoded in the sequence or a portion thereof wherein the portion consists of at 
least 3 to 5 contiguous amino acids, and more preferably at least 8 to 10 contiguous amino 
acids, and even more preferably 15 to 20 contiguous amino acids, or which is 
immunologically identifiable with a polypeptide encoded in the sequence. 
3 5 A "recombinant polypeptide" as used herein means at least a polypeptide which by 

virtue of its origin or manipulation is not associated with all or a portion of the polypeptide 
with which it is associated in nature and/or is linked to a polypeptide other than that to which it 
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is linked in nature. A recombinant or derived polypeptide is not necessarily translated from a 
designated nucleic acid sequence. It also may be generated in any manner, including chemical 
synthesis or expression of a recombinant expression system. 

The term "synthetic peptide" as used herein means a polymeric form of amino acids of 
5 any length, which may be chemically synthesized by methods well-known to the routineer. 
These synthetic peptides are useful in various applications. 

The term "polynucleotide" as used herein means a polymeric form of nucleotides of 
any length, either ribonucleotides or deoxyribonucleotides. This term refers only to the 
primary structure of the molecule. Thus, the term includes double- and single-stranded DNA, 
10 as well as, double- and single-stranded RNA. It also includes modifications, such as 
methylation or capping, and unmodified forms of the polynucleotide. 

"A sequence corresponding to a cDNA" means that the sequence contains a 
polynucleotide sequence that is identical to or complementary to a sequence in the designated 
DNA. The degree (or "percent") of identity or complementarity to the cDN A will be 

1 5 approximately 50% or greater, will preferably be at least about 70% or greater, and more 

preferably will be at least about 90%. The sequence that corresponds will be at least about 50 
nucleotides in length, will preferably be about 60 nucleotides in length, and more preferably, 
will be at least about 70 nucleotides in length. The correspondence between the gene or gene 
fragment of interest and the cDNA can be determined by methods known in the art, and 

2 0 include, for example, a direct comparison of the sequenced material with the cDNAs 

described, or hybridization and digestion with single strand nucleases, followed by size 
determination of the digested fragments. 

"Purified polynucleotide" refers to a polynucleotide of interest or fragment thereof 
which is essentially free, i.e., contains less than about 50%, preferably less than about 70%, 

2 5 and more preferably, less than about 90% of the protein with which the polynucleotide is 

naturally associated. Techniques for purifying polynucleotides of interest are well-known in 
the art and include, for example, disruption of the cell containing the polynucleotide with a 
chaotropic agent and separation of the polynucleotide(s) and proteins by ion-exchange 
chromatography, affinity chromatography and sedimentation according to density. Thus, 

3 0 "purified polypeptide" means a polypeptide of interest or fragment thereof which is essentially 

free, that is, contains less than about 50%, preferably less than about 70%, and more 
preferably, less than about 90% of cellular components with which the polypeptide of interest 
is naturally associated. Methods for purifying are known in the art. 

The term "isolated" means that the material is removed from its original environment 
3 5 (e.g., the natural environment if it is naturally occurring). For example, a naturally-occurring 
polynucleotide or polypeptide present in a living animal is not isolated, but the same 
polynucleotide or DNA or polypeptide, which is separated from some or all of the coexisting 
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materials in the natural system, is isolated. Such polynucleotide could be part of a vector 
and/or such polynucleotide or polypeptide could be part of a composition, and still be isolated 
in that the vector or composition is not part of its natural environment. 

"Polypeptide" as used herein indicates a molecular chain of amino acids and does not 
5 refer to a specific length of the product. Thus, peptides, oligopeptides and proteins are 

included within the definition of polypeptide. This term, however, is not intended to refer to 
post-expression modifications of the polypeptide, for example, glycosylates, acetylations, 
phosphorylations and the like. 

"Recombinant host cells," "host cells," "cells," "cell lines," "cell cultures," and other 
1 0 such terms denoting microorganisms or higher eukaryotic cell lines cultured as unicellular 

entities refer to cells which can be, or have been, used as recipients for recombinant vector or 
other transferred DNA, and include the original progeny of the original cell which has been 
transfected. 

As used herein "replicon" means any genetic element, such as a plasmid, a 

1 5 chromosome or a virus, that behaves as an autonomous unit of polynucleotide replication 

within a cell. 

A "vector" is a replicon in which another polynucleotide segment is attached, such as to 
bring about the replication and/or expression of the attached segment. 

The term "control sequence" refers to polynucleotide sequences which are necessary to 

2 0 effect the expression of coding sequences to which they are ligated. The nature of such control 

sequences differs depending upon the host organism. In prokaryotes, such control sequences 
generally include promoter, ribosomal binding site and terminators; in eukaryotes, such control 
sequences generally include promoters, terminators and, in some instances, enhancers. The 
term "control sequence" thus is intended to include at a minimum all components whose 

2 5 presence is necessary for expression, and also may include additional components whose 

presence is advantageous, for example, leader sequences. 

"Operably linked" refers to a situation wherein the components described are in a 
relationship permitting them to function in their intended manner. Thus, for example, a control 
sequence "operably linked" to a coding sequence is ligated in such a manner that expression of 

3 0 the coding sequence is achieved under conditions compatible with the control sequences. 

The term "open reading frame" or "ORF" refers to a region of a polynucleotide 
sequence which encodes a polypeptide; this region may represent a portion of a coding 
sequence or a total coding sequence. 

A "coding sequence" is a polynucleotide sequence which is transcribed into mRNA and 
3 5 translated into a polypeptide when placed under the control of appropriate regulatory 

sequences. The boundaries of the coding sequence are determined by a translation start codon 
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at the 5' -terminus and a translation stop codon at the 3' -terminus. A coding sequence can 
include, but is not limited to, mRNA, cDNA, and recombinant polynucleotide sequences. 

The term "immunologically identifiable with/as" refers to the presence of epitope(s) and 
polypeptide(s) which also are present in and are unique to the designated polypeptide(s). 
5 Immunological identity may be determined by antibody binding and/or competition in binding. 
These techniques are known to the routineer and also are described herein. The uniqueness of 
an epitope also can be determined by computer searches of known data banks, such as 
GenBank, for the polynucleotide sequences which encode the epitope, and by amino acid 
sequence comparisons with other known proteins. 
10 As used herein, "epitope" means an antigenic determinant of a polypeptide. 

Conceivably, an epitope can comprise three amino acids in a spatial conformation which is 
unique to the epitope. Generally, an epitope consists of at least five such amino acids, and 
more usually, it consists of at least eight to ten amino acids. Methods of examining spatial 
conformation are known in the art and include, for example, x-ray crystallography and two- 

1 5 dimensional nuclear magnetic resonance. 

A "conformational epitope" is an epitope that is comprised of specific juxtaposition of 
amino acids in an immunologically recognizable structure, such amino acids being present on 
the same polypeptide in a contiguous or non-contiguous order or present on different 
polypeptides. 

2 0 A polypeptide is "immunologically reactive" with an antibody when it binds to an 

antibody due to antibody recognition of a specific epitope contained within the polypeptide. 
Immunological reactivity may be determined by antibody binding, more particularly by the 
kinetics of antibody binding, and/or by competition in binding using as competitor(s) a known 
polypeptide(s) containing an epitope against which the antibody is directed. The methods for 

2 5 determining whether a polypeptide is immunologically reactive with an antibody are known in 

the art. 

As used herein, the term "immunogenic polypeptide containing an epitope of interest" 
means naturally occurring polypeptides of interest or fragments thereof, as well as 
polypeptides prepared by other means, for example, by chemical synthesis or the expression 

3 0 of the polypeptide in a recombinant organism. 

The term "transformation" refers to the insertion of an exogenous polynucleotide into a 
host cell, irrespective of the method used for the insertion. For example, direct uptake, 
transduction or f-mating are included. The exogenous polynucleotide may be maintained as a 
non-integrated vector, for example, a plasmid, or alternatively, may be integrated into the host 
3 5 genome. 

"Treatment" refers to prophylaxis and/or therapy. 
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The term "individual" as used herein refers to vertebrates, particularly members of the 
mammalian species and includes but is not limited to domestic animals, sports animals, 
primates and humans; more particularly the term refers to humans. 

The term "sense strand" or "plus strand" (or "+") as used herein denotes a nucleic acid 
5 that contains the sequence that encodes the polypeptide. The term "antisense strand" or "minus 
strand" (or "-") denotes a nucleic acid that contains a sequence that is complementary to that of 
the "plus" strand. 

The term "test sample" refers to a component of an individual's body which is the 
source of the analyte (such as, antibodies of interest or antigens of interest). These 
1 0 components are well known in the art. These test samples include biological samples which 
can be tested by the methods of the present invention described herein and include human and 
animal body fluids such as whole blood, serum, plasma, cerebrospinal fluid, urine, lymph 
fluids, and various external secretions of the respiratory, intestinal and genitorurinary tracts, 
tears, saliva, milk, white blood cells, myelomas and the like; biological fluids such as cell 

1 5 culture supernatants; fixed tissue specimens; and fixed cell specimens. 

"Purified product" refers to a preparation of the product which has been isolated from 
the cellular constituents with which the product is normally associated, and from other types of 
cells which may be present in the sample of interest. 

"PNA" denotes a "peptide nucleic acid analog" which may be utilized in a procedure 

2 0 such as an assay described herein to determine the presence of a target. "MA" denotes a 

"morpholino analog" which may be utilized in a procedure such as an assay described herein to 
determine the presence of a target. See, for example, U.S. Patent No. 5,378,841 , which is 
incorporated herein by reference. PNAs are neutrally charged moieties which can be directed 
against RNA targets or DNA. PNA probes used in assays in place of, for example, the DNA 

2 5 probes of the present invention, offer advantages not achievable when DNA probes are used. 

These advantages include manufacturability, large scale labeling, reproducibility, stability, 
insensitivity to changes in ionic strength and resistance to enzymatic degradation which is 
present in methods utilizing DNA or RNA. These PNAs can be labeled with such signal 
generating compounds as fluorescein, radionucleotides, chemiluminescent compounds, and 

3 0 the like. PNAs or other nucleic acid analogs such as MAs thus can be used in assay methods 

in place of DNA or RNA. Although assays are described herein utilizing DNA probes, it is 
within the scope of the routineer that PNAs or MAs can be substituted for RNA or DNA with 
appropriate changes if and as needed in assay reagents. 

"Analyte," as used herein, is the substance to be detected which may be present in the 
3 5 test sample. The analyte can be any substance for which there exists a naturally occurring 

specific binding member (such as, an antibody), or for which a specific binding member can 
be prepared. Thus, an analyte is a substance that can bind to one or more specific binding 
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members in an assay. "Analyte" also includes any antigenic substances, haptens, antibodies, 
and combinations thereof. As a member of a specific binding pair, the analyte can be detected 
by means of naturally occurring specific binding partners (pairs) such as the use of intrinsic 
factor protein as a member of a specific binding pair for the determination of Vitamin B12, the 
5 use of folate-binding protein to determine folic acid, or the use of a lectin as a member of a 
specific binding pair for the determination of a carbohydrate. The analyte can include a 
protein, a peptide, an amino acid, a nucleotide target, and the like. 

An "Expressed Sequence Tag" or "EST" refers to the partial sequence of a cDNA insert 
which has been made by reverse transcription of mRNA extracted from a tissue, followed by 
1 0 insertion into a vector. 

A "transcript image" refers to a table or list giving the quantitative distribution of ESTs 
in a library and represents the genes active in the tissue from which the library was made. 

The present invention provides assays which utilize specific binding members. A 
"specific binding member," as used herein, is a member of a specific binding pair. That is, 

1 5 two different molecules where one of the molecules through chemical or physical means 

specifically binds to the second molecule. Therefore, in addition to antigen and antibody 
specific binding pairs of common immunoassays, other specific binding pairs can include 
biotin and avidin, carbohydrates and lectins, complementary nucleotide sequences, effector 
and receptor molecules, cofactors and enzymes, enzyme inhibitors and enzymes, and the like. 

2 0 Furthermore, specific binding pairs can include members that are analogs of the original 

specific binding members, for example, an analyte-analog. Immunoreactive specific binding 
members include antigens, antigen fragments, antibodies and antibody fragments, both 
monoclonal and polyclonal, and complexes thereof, including those formed by recombinant 
DNA molecules. 

2 5 The term "hapten," as used herein, refers to a partial antigen or non-protein binding 

member which is capable of binding to an antibody, but which is not capable of eliciting 
antibody formation unless coupled to a carrier protein. 

A "capture reagent," as used herein, refers to an unlabeled specific binding member 
which is specific either for the analyte as in a sandwich assay, for the indicator reagent or 

3 0 analyte as in a competitive assay, or for an ancillary specific binding member, which itself is 

specific for the analyte, as in an indirect assay. The capture reagent can be directly or 
indirectly bound to a solid phase material before the performance of the assay or during the 
performance of the assay, thereby enabling the separation of immobilized complexes from the 
test sample. 

3 5 The "indicator reagent" comprises a "signal-generating compound" ("label") which is 

capable of generating and generates a measurable signal detectable by external means, 
conjugated ("attached") to a specific binding member. "Specific binding member" as used 
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herein means a member of a specific binding pair. That is, two different molecules where one 
of the molecules through chemical or physical means specifically binds to the second molecule. 
In addition to being an antibody member of a specific binding pair, the indicator reagent also 
can be a member of any specific binding pair, including either hapten-anti-hapten systems such 
5 as biotin or anti-biotin, avidin or biotin, a carbohydrate or a lectin, a complementary nucleotide 
sequence, an effector or a receptor molecule, an enzyme cofactor and an enzyme, an enzyme 
inhibitor or an enzyme, and the like. An immunoreactive specific binding member can be an 
antibody, an antigen, or an antibody/antigen complex that is capable of binding either to 
polypeptide of interest as in a sandwich assay, to the capture reagent as in a competitive assay, 
10 or to the ancillary specific binding member as in an indirect assay. When describing probes 

and probe assays, the term "reporter molecule" may be used. A reporter molecule comprises a 
signal generating compound as described hereinabove conjugated to a specific binding member 
of a specific binding pair, such as carbazol or adamantane. 

The various "signal-generating compounds" (labels) contemplated include 

1 5 chromogens, catalysts such as enzymes, luminescent compounds such as fluorescein and 

rhodamine, chemiluminescent compounds such as dioxetanes, acridiniums, phenanthridiniums 
and luminol, radioactive elements, and direct visual labels. Examples of enzymes include 
alkaline phosphatase, horseradish peroxidase, beta-galactosidase, and the like. The selection of 
a particular label is not critical, but it will be capable of producing a signal either by itself or in 

2 0 conjunction with one or more additional substances. 

"Solid phases" ("solid supports") are known to those in the art and include the walls of 
wells of a reaction tray, test tubes, polystyrene beads, magnetic beads, nitrocellulose strips, 
membranes, microparticles such as latex particles, sheep (or other animal) red blood cells, and 
Duracytes® (red blood cells "fixed" by pyruvic aldehyde and formaldehyde, available from 

2 5 Abbott Laboratories, Abbott Park, IL) and others. The "solid phase" is not critical and can be 

selected by one skilled in the art. Thus, latex particles, microparticles, magnetic or non- 
magnetic beads, membranes, plastic tubes, walls of microtiter wells, glass or silicon chips, 
sheep (or other suitable animal's) red blood cells and Duracytes® are all suitable examples. 
Suitable methods for immobilizing peptides on solid phases include ionic, hydrophobic, 

3 0 covalent interactions and the like. A "solid phase", as used herein, refers to any material 

which is insoluble, or can be made insoluble by a subsequent reaction. The solid phase can be 
chosen for its intrinsic ability to attract and immobilize the capture reagent. Alternatively, the 
solid phase can retain an additional receptor which has the ability to attract and immobilize the 
capture reagent. The additional receptor can include a charged substance that is oppositely 
3 5 charged with respect to the capture reagent itself or to a charged substance conjugated to the 
capture reagent. As yet another alternative, the receptor molecule can be any specific binding 
member which is immobilized upon (attached to) the solid phase and which has the ability to 
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immobilize the capture reagent through a specific binding reaction. The receptor molecule 
enables the indirect binding of the capture reagent to a solid phase material before the 
performance of the assay or during the performance of the assay. The solid phase thus can be 
a plastic, derivatized plastic, magnetic or non-magnetic metal, glass or silicon surface of a test 
5 tube, microtiter well, sheet, bead, microparticle, chip, sheep (or other suitable animal's) red 
blood cells, Duracytes® and other configurations known to those of ordinary skill in the art. 

It is contemplated and within the scope of the present invention that the solid phase also 
can comprise any suitable porous material with sufficient porosity to allow access by detection 
antibodies and a suitable surface affinity to bind antigens. Microporous structure generally are 

1 0 preferred, but materials with gel structure in the hydrated state may be used as well. Such 

useful solid supports include but are not limited to nitrocellulose and nylon. It is contemplated 
that such porous solid supports described herein preferably are in the form of sheets of 
thickness from about 0.01 to 0.5 mm, preferably about 0.1mm. The pore size may vary 
within wide limits, and preferably is from about 0.025 to 15 microns, especially from about 
15 0.15 to 15 microns. The surface of such supports may be activated by chemical processes 
which cause covalent linkage of the antigen or antibody to the support. The irreversible 
binding of the antigen or antibody is obtained, however, in general, by adsorption on the 
porous material by poorly understood hydrophobic forces. Other suitable solid supports are 
known in the art. 

2 0 Reagents . 

The present invention provides reagents such as human MMP 19 polynucleotide 
sequences derived from an MMP19 of interest, polypeptides encoded therein, and antibodies 
developed from these polypeptides. The present invention also provides reagents such as 
oligonucleotide fragments derived from the disclosed polynucleotides and nucleic acid 

2 5 sequences complementary to the these polynucleotides. The polynucleotides or polypeptides 

or antibodies of the present invention may be used in the diagnosis, prognosis, and/or 
treatment of individuals with conditions associated with the expression of the MMP 19 gene, 
such as cancer, arthritis, or inflammation, or to identify a predisposition to these conditions. 
The sequences disclosed herein represent unique polynucleotides which can be used in assays 

3 0 or for producing a disease specific profile of gene transcription activity. 

Selected MMP 19 polynucleotides can be used in the methods described herein for the 
detection of normal or altered gene expression. Such methods may employ the MMP 19 
polynucleotides disclosed herein or oligonucleotides, fragments or derivatives thereof, or 
nucleic acid sequences complementary to these polynucleotides. 
3 5 The polynucleotides disclosed herein, their complementary sequences or fragments of 

either can be used in assays to detect, amplify or quantify genes, cDNAs or mRNAs relating to 
cancer, arthritis, or inflammation and associated conditions. They also can be used to identify 
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an entire or partial coding region which encodes for MMP19 polypeptide. They further can be 
provided in individual containers in the form of a kit for assays, or provided as individual 
compositions. If provided in a kit for assays, other suitable reagents such as buffers, 
conjugates and the like may be included. 
5 The polynucleotide(s) may be in the form of mRNA or DNA. Polynucleotides in the 

form of DNA, cDN A, genomic DNA, and synthetic DNA are within the scope of the present 
invention. The DNA may be double-stranded or single-stranded, and if single stranded may 
be the coding (sense) strand or non-coding (anti-sense) strand. The coding sequence which 
encodes the polypeptide may be identical to the coding sequence provided herein or may be a 
1 0 different coding sequence which coding sequence, as a result of the redundancy or degeneracy 
of the genetic code, encodes the same polypeptide as the DNA provided herein. 

This polynucleotide may include only the coding sequence for the polypeptide, or the 
coding sequence for the polypeptide and additional coding sequence such as a leader or 
secretory sequence or a proprotein sequence, or the coding sequence for the polypeptide (and 

1 5 optionally additional coding sequence) and non-coding sequence, such as a non-coding 

sequence 5' and/or 3' of the coding sequence for the polypeptide. 

In addition, the invention includes variant polynucleotides containing modifications 
such as polynucleotide deletions, substitutions or additions; and any polypeptide modification 
resulting from the variant polynucleotide sequence. A polynucleotide of the present invention 

2 0 also may have a coding sequence which is a naturally occurring allelic variant of the coding 

sequence provided herein. 

In addition, the coding sequence for the polypeptide may be fused in the same reading 
frame to a polynucleotide sequence which aids in expression and secretion of a polypeptide 
from a host cell, for example, a leader sequence which functions as a secretory sequence for 

2 5 controlling transport of a polypeptide from the cell. The polypeptide having a leader sequence 

is a preprotein and may have the leader sequence cleaved by the host cell to form the 
polypeptide. The polynucleotides may also encode for a proprotein which is the protein plus 
additional amino acid residues at the N-terminus. A protein having a prosequence is a 
proprotein and may in some cases be an inactive form of the protein. Once the prosequence is 

3 0 cleaved an active protein remains. Thus, the polynucleotide of the present invention may 

encode for a protein, or for a protein having a prosequence or for a protein having both a 
presequence (leader sequence) and a prosequence. 

The polynucleotides of the present invention may also have the coding sequence fused 
in frame to a marker sequence which allows for purification of the polypeptide of the present 
3 5 invention. The marker sequence may be a hexa-histidine tag supplied by a pQE-9 vector to 
provide for purification of the polypeptide fused to the marker in the case of a bacterial host, 
or, for example, the marker sequence may be a hemagglutinin (HA) tag when a mammalian 
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host, e.g. COS-7 cells, is used. The HA tag corresponds to an epitope derived from the 
influenza hemagglutinin protein. See, for example, I. Wilson, et al., Cell 37:767 (1984). 

It is contemplated that polynucleotides will be considered to hybridize to the sequences 
provided herein if there is at least 50%, and preferably at least 70% identity between the 
5 polynucleotide and the sequence. 

Probes constructed according to the polynucleotide sequences of the present invention 
can be used in various assay methods to provide various types of analysis. For example, such 
probes can be used in Fluorescent In Situ Hybridization (FISH) technology to perform 
chromosomal analysis, and used to identify cancer-specific structural alterations in the 
1 0 chromosomes, such as deletions or translocations that are visible from chromosome spreads or 
detectable using PCR-generated and/or allele specific oligonucleotides probes, allele specific 
amplification or by direct sequencing. Probes also can be labeled with radioisotopes, directly- 
or indirectly- detectable haptens, or fluorescent molecules, and utilized for in situ hybridization 
studies to evaluate the mRNA expression of the gene comprising the polynucleotide in fixed 

1 5 tissue specimens or cells. 

The present invention further relates to an MMP19 polypeptide which has the deduced 
amino acid sequence as provided herein, as well as fragments, analogs and derivatives of such 
polypeptide. The polypeptide of the present invention may be a recombinant polypeptide, a 
natural purified polypeptide or a synthetic polypeptide. The fragment, derivative or analog of 

2 0 the MMP19 polypeptide may be one in which one or more of the amino acid residues is 

substituted with a conserved or non-conserved amino acid residue (preferably a conserved 
amino acid residue) and such substituted amino acid residue may or may not be one encoded 
by the genetic code; or it may be one in which one or more of the amino acid residues includes 
a substituent group; or it may be one in which the polypeptide is fused with another 

2 5 compound, such as a compound to increase the half-life of the polypeptide (for example, 

polyethylene glycol); or it may be one in which the additional amino acids are fused to the 
polypeptide, such as a leader or secretory sequence or a sequence which is employed for 
purification of the polypeptide or a proprotein sequence. Such fragments, derivatives and 
analogs are within the scope of the present invention. The polypeptides and polynucleotides of 

3 0 the present invention are preferably provided in an isolated form, and preferably purified. 

Thus, a polypeptide of the present invention may have an amino acid sequence that is 
identical to that of the naturally occurring polypeptide or that is different by minor variations 
due to one or more amino acid substitutions. The variation may be a "conservative change" 
typically in the range of about 1 to 5 amino acids, wherein the substituted amino acid has 
3 5 similar structural or chemical properties, eg, replacement of leucine with isoleucine or 

threonine with serine. In contrast, variations may include nonconservative changes, eg, 
replacement of a glycine with a tryptophan. Similar minor variations may also include amino 
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acid deletions or insertions, or both. Guidance in determining which and how many amino 
acid residues may be substituted, inserted or deleted without changing biological or 
immunological activity may be found using computer programs well known in the art, for 
example, DNASTAR software (DNASTAR Inc., Madison WI). 
5 Preferably, a polypeptide of the present invention is one having at least 40% identity, 

more preferably, at least 50% identity, even more preferably at least 60% identity and even 
more preferably, at least 70% identity to SEQ ID NO: 10. Such a polypeptide may also 
possess the ability to cleave at least the substrate described as SEQ ID NO: 13. 

The present invention also provides an antibody produced by using a purified MMP19 

1 0 polypeptide of which at least a portion of the polypeptide is encoded by MMP19 

polynucleotide selected from the polynucleotides provided herein. These antibodies may be 
used in the methods provided herein for the detection of MMP19 polypeptides in test samples. 
The antibody also may be used for therapeutic purposes, for example, in neutralizing the 
activity of a MMP19 polypeptide in conditions associated with altered or abnormal expression 
15 oftheMMP19gene. 

This invention also provides teachings as to the production of the polynucleotides and 
polypeptides provided herein. 

Probe Assays 

The sequences provided herein may be used to produce probes which can be used in 

2 0 assays for the detection of nucleic acids in test samples. The probes may be designed from 

conserved nucleotide regions of the polynucleotides of interest or from non-conserved 
nucleotide regions of the polynucleotide of interest. The design of such probes for 
optimization in assays is within the skill of the routineer. Generally, nucleic acid probes are 
developed from non-conserved or unique regions when maximum specificity is desired, and 

2 5 nucleic acid probes are developed from conserved regions when assaying for nucleotide 

regions that are closely related to, for example, different members of a multigene family or in 
related species like mouse and man. 

The polymerase chain reaction (PCR) is a technique for amplifying a desired nucleic 
acid sequence (target) contained in a nucleic acid or mixture thereof. In PCR, a pair of primers 

3 0 are employed in excess to hybridize at the outside ends of complementary strands of the target 

nucleic acid. The primers are each extended by a polymerase using the target nucleic acid as a 
template. The extension products become target sequences themselves, following dissociation 
from the original target strand. New primers then are hybridized and extended by a 
polymerase, and the cycle is repeated to geometrically increase the number of target sequence 
3 5 molecules. PCR is disclosed in U.S. patents 4,683,195 and 4,683,202, which are 
incorporated herein by reference. 
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The Ligase Chain Reaction (LCR) is an alternate method for nucleic acid amplification. 
In LCR, probe pairs are used which include two primary (first and second) and two secondary 
(third and fourth) probes, all of which are employed in molar excess to target. The first probe 
hybridizes to a first segment of the target strand and the second probe hybridizes to a second 
5 segment of the target strand, the first and second segments being contiguous so that the 

primary probes abut one another in 5' phosphate^' hydroxyl relationship, and so that a ligase 
can covalently fuse or ligate the two probes into a fused product. In addition, a third 
(secondary) probe can hybridize to a portion of the first probe and a fourth (secondary) probe 
can hybridize to a portion of the second probe in a similar abutting fashion. Of course, if the 
1 0 target is initially double stranded, the secondary probes also will hybridize to the target 

complement in the first instance. Once the ligated strand of primary probes is separated from 
the target strand, it will hybridize with the third and fourth probes which can be ligated to form 
a complementary, secondary ligated product. It is important to realize that the ligated products 
are functionally equivalent to either the target or its complement. By repeated cycles of 

1 5 hybridization and ligation, amplification of the target sequence is achieved. This technique is 

described more completely in EP-A- 320 308 to K. Backman published June 16, 1989 and 
EP-A-439 182 to K. Backman et al, published July 31, 1991 , both of which are incorporated 
herein by reference. 

For amplification of mRNAs, it is within the scope of the present invention to reverse 

2 0 transcribe mRNA into cDNA followed by polymerase chain reaction (RT-PCR); or, to use a 

single enzyme for both steps as described in U.S. Patent No. 5,322,770, which is 
incorporated herein by reference; or reverse transcribe mRNA into cDNA followed by 
asymmetric gap ligase chain reaction (RT-AGLCR) as described by R. L. Marshall, et al., 
PCR Methods and Applications 4: 80-84 (1994), which also is incorporated herein by 

2 5 reference. 

Other known amplification methods which can be utilized herein include but are not 
limited to the so-called "NASBA" or n 3SR" technique described in PNAS USA 87: 1 874- 1 878 
(1990) and also described in Nature 350 (No. 6313):91-92 (1991); Q-beta amplification as 
described in published European Patent Application (EPA) No. 4544610; strand displacement 

3 0 amplification (as described in G. T. Walker et al., Clin. Chem . 42:9-13 [1 996]) and European 

Patent Application No. 684315; and target mediated amplification, as described by PCT 
Publication WO 9322461. 

In one embodiment, the present invention generally comprises the steps of contacting a 
test sample suspected of containing a target polynucleotide sequence with amplification 
3 5 reaction reagents comprising an amplification primer, and a detection probe that can hybridize 
with an internal region of the amplicon sequences. Probes and primers employed according to 
the method herein provided are labeled with capture and detection labels wherein probes are 
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labeled with one type of label and primers are labeled with the other type of label. 
Additionally, the primers and probes are selected such that the probe sequence has a lower melt 
temperature than the primer sequences. The amplification reagents, detection reagents and test 
sample are placed under amplification conditions whereby, in the presence of target sequence, 
5 copies of the target sequence (an amplicon) are produced. In the usual case, the amplicon is 
double stranded because primers are provided to amplify a target sequence and its 
complementary strand. The double stranded amplicon then is thermally denatured to produce 
single stranded amplicon members. Upon formation of the single stranded amplicon 
members, the mixture is cooled to allow the formation of complexes between the probes and 
1 0 single stranded amplicon members. 

As the single stranded amplicon sequences and probe sequences are cooled, the probe 
sequences preferentially bind the single stranded amplicon members. This finding is 
counterintuitive given that the probe sequences are generally selected to be shorter than the 
primer sequences and therefore have a lower melt temperature than the primers. Accordingly, 

1 5 the melt temperature of the amplicon produced by the primers should also have a higher melt 

temperature than the probes. Thus, as the mixture is cooled, the re-formation of the double 
stranded amplicon is expected although as previously stated, this is not the case. Instead, the 
probes are found to preferentially bind the single stranded amplicon members. Moreover, this 
preference of probe/single stranded amplicon binding exists even when the primer sequences 

2 0 are added in excess of the probes. 

After the probe/single stranded amplicon member hybrids are formed, they are 
detected. Standard heterogeneous assay formats are suitable for detecting the hybrids using 
the detection labels and capture labels present on the primers and probes. The hybrids can be 
bound to a solid phase reagent by virtue of the capture label and detected by virtue of the 

2 5 detection label. In cases where the detection label is directly detectable, the presence of the 

hybrids on the solid phase can be detected by causing the label to produce a detectable signal, 
if necessary, and detecting the signal. In cases where the label is not directly detectable, the 
captured hybrids can be contacted with a conjugate, which generally comprises a binding 
member attached to a directly detectable label. The conjugate becomes bound to the complexes 

3 0 and the conjugates presence on the complexes can be detected with the directly detectable label. 

Thus, the presence of the hybrids on the solid phase reagent can be determined. Those skilled 
in the art will recognize that wash steps may be employed to wash away unhybridized 
amplicon or probe as well as unbound conjugate. 

A test sample is typically anything suspected of containing a target sequence. Test 
3 5 samples can be prepared using methodologies well known in the art such as by obtaining a 

specimen from an individual and, if necessary, disrupting any cells contained therein to release 
target nucleic acids. Although the target sequence is described as single stranded, it also is 
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contemplated to include the case where the target sequence is actually double stranded but is 
merely separated from its complement prior to hybridization with the amplification primer 
sequences. In the case where PGR is employed in this method, the ends of the target 
sequences are usually known. In cases where LCR or a modification thereof is employed in 
5 the preferred method, the entire target sequence is usually known. Typically, the target 
sequence is a nucleic acid sequence such as, for example, RNA or DNA. 

The method provided herein can include well known amplification reactions that utilize 
thermal cycle reaction mixtures, particularly PCR and GLCR. Amplification reactions 
typically employ primers to repeatedly generate copies of a target nucleic acid sequence, which 
1 0 target sequence is usually a small region of a much larger nucleic acid sequence. Primers are 
themselves nucleic acid sequences that are complementary to regions of a target sequence. 
Under amplification conditions, these primers hybridize or bind to the complementary regions 
of the target sequence. Copies of the target sequence typically are generated by the process of 
primer extension and/or ligation which utilizes enzymes with polymerase or ligase activity, 

1 5 separately or in combination, to add nucleotides to the hybridized primers and/or ligate adjacent 

probe pairs. The nucleotides that are added to the primers or probes, as monomers or 
preformed oligomers, are also complementary to the target sequence. Once the primers or 
probes have been sufficiently extended and/or ligated they are separated from the target 
sequence, for example, by heating the reaction mixture to a "melt temperature" which is one in 

2 0 which complementary nucleic acid strands dissociate. Thus, a sequence complementary to the 

target sequence is formed. 

A new amplification cycle then can take place to further amplify the number of target 
sequences by separating any double stranded sequences, allowing primers or probes to 
hybridize to their respective targets, extending and/or ligating the hybridized primers or probes 

2 5 and re-separating. The complementary sequences that are generated by amplification cycles 

can serve as templates for primer extension or filling the gap of two probes to further amplify 
the number of target sequences. Typically, a reaction mixture is cycled between 20 and 100 
times, more typically, a reaction mixture is cycled between 25 and 50 times. The numbers of 
cycles can be determined by the routineer. In this manner, multiple copies of the target 

3 0 sequence and its complementary sequence are produced. Thus, primers initiate amplification 

of the target sequence when it is present under amplification conditions. 

Generally, two primers which are complementary to a portion of a target strand and its 
complement are employed in PCR. For LCR, four probes, two of which are complementary 
to a target sequence and two of which are similarly complementary to the targets complement, 
3 5 are generally employed. In addition to the primer sets and enzymes previously mentioned, a 
nucleic acid amplification reaction mixture may also comprise other reagents which are well 
known and include but are not limited to: enzyme cofactors such as manganese; magnesium; 
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salts; nicotinamide adenine dinucleotide (NAD); and deoxynucleotide triphosphates (dNTPs) 
such as for example deoxyadenine triphosphate, deoxyguanine triphosphate, deoxycytosine 
triphosphate and deoxythymine triphosphate. 

While the amplification primers initiate amplification of the target sequence, in some 
5 cases, the detection (or hybridization) probe is not involved in amplification. Detection probes 
are generally nucleic acid sequences or uncharged nucleic acid analogs such as, for example, 
peptide nucleic acids which are disclosed in International Patent Application WO 92/20702; 
morpholino analogs which are described in U.S. Patents Nos 5,185,444, 5,034,506, and 
5,142,047; and the like. Depending upon the type of label carried by the probe, the probe is 
1 0 employed to capture or detect the amplicon generated by the amplification reaction. The probe 
is not involved in amplification of the target sequence and therefore may have to be rendered 
"non-extendable" in that additional dNTPs cannot be added to the probe. In and of themselves 
analogs usually are non-extendable and nucleic acid probes can be rendered non-extendable by 
modifying the 3' end of the probe such that the hydroxyl group is no longer capable of 

1 5 participating in elongation. For example, the 3' end of the probe can be functionalized with the 

capture or detection label to thereby consume or otherwise block the hydroxyl group. 
Alternatively, the 3' hydroxyl group simply can be cleaved, replaced or modified. U.S. Patent 
Application Serial No. 07/049,061 filed April 19, 1993 and incorporated herein by reference 
describes modifications which can be used to render a probe non-extendable. 

2 0 Accordingly, the ratio of primers to probes is not important. Thus, either the probes or 

primers can be added to the reaction mixture in excess whereby the concentration of one would 
be greater than the concentration of the other. Alternatively, primers and probes can be 
employed in equivalent concentrations. Preferably, however, the primers are added to the 
reaction mixture in excess of the probes. Thus, primer to probe ratios of, for example, 5: 1 and 

2 5 20:1 are preferred. 

While the length of the primers and probes can vary, the probe sequences are selected 
such that they have a lower melt temperature than the primer sequences. Hence, the primer 
sequences are generally longer than the probe sequences. Typically, the primer sequences are 
in the range of between 20 and 50 nucleotides long, more typically in the range of between 20 

3 0 and 30 nucleotides long. The typical probe is in the range of between 10 and 25 nucleotides 

long. 

Alternatively, a probe may. be involved in the amplifying a target sequence, via a 
process known as "nested PCR". In nested PCR, the probe has characteristics which are 
similar to those of the first and second primers normally used for amplification (such as length, 
3 5 melting temperature etc.) and as such, may itself serve as a primer in an amplification reaction. 
Generally in nested PCR, a first pair of primers (Pi and P2) are employed to form primary 
extension products. One of the primary primers (for example, Pi) may optionally be a capture 
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primer (i.e. linked to a member of a first reactive pair), whereas the other primary primer (P2) 
is not. A secondary extension product is then formed using the Pj primer and a probe (P2O 
which may also have a capture type label (such as a member of a second reactive pair) or a 
detection label at its 5' end. The probe is complimentary to and hybridizes at a site on the 
5 template near or adjacent (but not overlapping) the site where the 3' terminus of P2 would 
hybridize if it was still in solution. Thus, a labeled primer/probe set generates a secondary 
product which is shorter than the primary extension product. Furthermore, the secondary 
product may be detected either on the basis of its size or via its labeled ends (by detection 
methodologies well known to those of ordinary skill in the art). In this process, probe and 
1 0 primers are generally employed in equivalent concentrations. 

Various methods for synthesizing primers and probes are well known in the art. 
Similarly, methods for attaching labels to primers or probes are also well known in the art. 
For example, it is a matter of routine to synthesize desired nucleic acid primers or probes using 
conventional nucleotide phosphoramidite chemistry and instruments available from Applied 

1 5 Biosystems, Inc., (Foster City, CA), Dupont (Wilmington, DE), or Milligen (Bedford MA). 

Many methods have been described for labeling oligonucleotides such as the primers or probes 
of the present invention. Enzo Biochemical (New York, NY) and Clontech (Palo Alto, CA) 
both have described and commercialized probe labeling techniques. For example, a primary 
amine can be attached to a 3' oligo terminus using 3'-Amine-ON CPG™ (Clontech, Palo Alto, 

2 0 CA). Similarly, a primary amine can be attached to a 5' oligo terminus using Aminomodifier 

II® (Clontech). The amines can be reacted to various haptens using conventional activation 
and linking chemistries. In addition, copending applications US. Serial Nos. 625,566, filed 
December 1 1, 1990 and 630,908, filed December 20, 1990, which are each incorporated 
herein by reference, teach methods for labeling probes at their 5' and 3' termini, respectively. 

2 5 Publications W092/1 0505, published 25 June 1992 and WO 92/1 1 388 published 9 July 1992 

teach methods for labeling probes at their 5' and 3' ends, respectively. According to one 
known method for labeling an oligonucleotide, a label-phosphoramidite reagent is prepared 
and used to add the label to the oligonucleotide during its synthesis. See, for example, N.T. 
Thuong et al., Tet. Letters 29(46): 5905-5908 (1988); or J. S. Cohen et al., published U.S. 

3 0 Patent Application 07/246,688 (NTIS ORDER No. PAT-APPL-7-246,688) (1989). 

Preferably, probes are labeled at their 3' and 5' ends. 

Capture labels are carried by the primers or probes and can be a specific binding 
member which forms a binding pair with the solid phase reagent's specific binding member. It 
will be understood, of course that the primer or probe itself may serve as the capture label. 
3 5 For example, in the case where a solid phase reagent's binding member is a nucleic acid 

sequence, it may be selected such that it binds a complementary portion of the primer or probe 
to thereby immobilize the primer or probe to the solid phase. In cases where the probe itself 
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serves as the binding member, those skilled in the art will recognize that the probe will contain 
a sequence or "tail" that is not complementary to the single stranded amplicon members. In the 
case where the primer itself serves as the capture label, at least a portion of the primer will be 
free to hybridize with a nucleic acid on a solid phase because the probe is selected such that it 
5 is not fully complementary to the primer sequence. 

Generally, probe/single stranded amplicon member complexes can be detected using 
techniques commonly employed to perform heterogeneous immunoassays. Preferably, in this 
embodiment, detection is performed according to the protocols used by the commercially 
available Abbott LCx® instrumentation (Abbott Laboratories, Abbott Park, IL). 
1 0 The primers and probes disclosed herein are useful in typical PCR assays, wherein the 

test sample is contacted with a pair of primers, amplification is performed, the hybridization 
probe is added, and detection is performed. 

Another method provided by the present invention comprises contacting a test sample 
with a plurality of polynucleotides wherein at least one polynucleotide is provided herein, 

1 5 hybridizing the test sample with the plurality of polynucleotides and detecting the hybridization 

complexes. The hybridization complexes are identified and quantitated to compile a profile 
which is indicative of cancer, metastases, arthritis or inflammatory conditions. Expressed 
RNA sequences may further be detected by reverse transcription and amplification of the DNA 
product by procedures well-known in the art, including polymerase chain reaction (PCR). 

2 0 Antisense and Gene Therapy . 

The present invention also encompasses the use of gene therapy methods for the 
introduction of anti-sense MMP19 gene derived molecules such as polynucleotides or 
oligonucleotides of the present invention into patients having conditions (such as cancer, 
arthritis or inflammation) that are associated with bnormal expression of MMP19 

2 5 polynucleotides. These molecules, including antisense RNA and DNA fragments and 

ribozymes, are designed to inhibit the translation of a MMP19 polynucleotide mRNA, and may 
be used therapeutically in the treatment of conditions associated with altered or abnormal 
expression of the MMP19 polynucleotide. 

Alternatively, the oligonucleotides described above can be delivered to cells by 

3 0 procedures in the art such that the anti-sense RNA or DNA is subsequently expressed in vivo 

whereby it inhibits production of MMP19 polypeptide in the manner described above. 
Antisense constructs to MMP 19 polynucleotide, therefore, reverse the action of MMP19 
transcripts and may be used for treating cancer, arthritis, inflammation and related disease 
states. These antisense constructs may also be used to treat tumor metastases. 
3 5 Antisense technology can be used to control gene expression through triple-helix 

formation or antisense DNA or RNA, both of which methods are based on binding of a 
polynucleotide to DNA or RNA. For example, the 5' coding portion of the polynucleotide 
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sequence, which encodes for the polypeptide of the present invention, is used to design an 
antisense RNA oligonucleotide of from 10 to 40 base pairs in length. A DNA oligonucleotide 
is designed to be complementary to a region of the gene involved in transcription, thereby 
preventing transcription and the production of MMP19 polypeptide. For triple helix, see, for 
example, Lee et al, Nucl. Acids Res . 6:3073 (1979); Cooney et a], Science 241:456 (1988); 
and Dervan et al, Science 251:1360 (1991) The antisense RNA oligonucleotide hybridizes to 
the mRNA in vivo and blocks translation of an mRNA molecule into the MMP19 polypeptide. 
For antisense, see, for example, Okano, J. Neurochem . 56:560 (1991); and 
"Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression", CRC Press, Boca 
Raton, Fla. (1988). Antisense oligonucleotides act with greater efficacy when modified to 
contain artificial internucleotide linkages which render the molecule resistant to nucleolytic 
cleavage. Such artificial internucleotide linkages include but are not limited to 
methylphosphonate, phosphorothiolate and phosphoroamydate internucleotide linkages. 

The present invention also encompasses gene therapy whereby the gene encoding 
MMP19 is regulated in a patient. Various methods of transferring or delivering DNA to cells 
for expression of the gene product protein, otherwise referred to as gene therapy, are disclosed 
in "Gene Transfer into Mammalian Somatic Cells in vivo", N. Yang, Crit. Rev. Biotechn . 
12(4): 335-356 (1992), which is hereby incorporated by reference. Gene therapy 
encompasses incorporation of DNA sequences into somatic cells or germ line cells for use in 
either ex vivo or in vivo therapy. Gene therapy functions to replace genes, augment normal or 
abnormal gene function, and to combat infectious diseases and other pathologies. 

Strategies for treating medical problems with gene therapy include therapeutic strategies 
such as identifying the defective gene and then adding a functional gene to either replace the 
function of the defective gene or to augment a slightly functional gene; or prophylactic 
strategies, such as adding a gene which encodes a protein product that will treat the condition 
or that will make the tissue or organ more susceptible to a treatment regimen. 

Many protocols for transfer of MMP19 DNA or MMP19 regulatory sequences are 
envisioned in this invention. Transfection of promoter sequences, other than one specifically 
associated with MMP19, or other sequences which would increase production of MMP19 
protein are also envisioned as methods of gene therapy. An example of this technology is 
found in Transkaryotic Therapies, Inc., of Cambridge, Massachusetts, using homologous 
recombination to insert a "genetic switch" that turns on an erythropoietin gene in cells. See 
Genetic Engineering News . April 15, 1994. Such "genetic switches" could be used to activate 
MMP19 (or a MMP19 receptor) in cells not normally expressing these proteins. 

Gene transfer methods for gene therapy fall into three broad categories: (1) physical 
(e.g., electroporation, direct gene transfer and particle bombardment), (2) chemical (e.g. lipid- 
based carriers and other non-viral vectors) and (3) biological (e.g. virus derived vectors). For 
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example, non-viral vectors such as liposomes coated with DNA may be directly injected 
intravenously into the patient. It is believed that the liposome/DNA complexes are 
concentrated in the liver where they deliver the DNA to macrophages and Kupffer cells. 
Additionally, vectors or the "naked" DNA of the gene may be directly injected into the desired 
5 organ, tissue or tumor for targeted delivery of the therapeutic DNA. 

Gene therapy methodologies can also be described by delivery site. Fundamental ways 
to deliver genes include ex vivo gene transfer, in vivo gene transfer, and in vitro gene transfer. 
In ex vivo gene transfer, cells are taken from the patient and grown in cell culture. The DNA 
is transfected into the cells, the transfected cells are expanded in number and then reimplanted 
10 in the patient. In in vitro gene transfer, the transformed cells are cells growing in culture, such 
as tissue culture cells, and not particular cells from a particular patient. These "laboratory 
cells" are transfected, the transfected cells are selected and expanded for either implantation 
into a patient or for other uses. In vivo gene transfer involves introducing the DNA into the 
cells of the patient when the cells are within the patient. All three of the broad based categories 

1 5 described above may be used to achieve gene transfer in vivo y ex vivo, and in vitro. 

Mechanical (i.e. physical) methods of DNA delivery can be achieved by direct injection 
of DNA, such as microinjection of DNA into germ or somatic cells, pneumatically delivered 
DNA-coated particles, such as the gold particles used in a "gene gun," and inorganic chemical 
approaches such as calcium phosphate transfection. It has been found that physical injection 

2 0 of plasmid DNA into muscle cells yields a high percentage of cells which are transfected and 

have a sustained expression of marker genes. The plasmid DNA may or may not integrate into 
the genome of the cells. Non-integration of the transfected DNA would allow the transfection 
and expression of gene product proteins in terminally differentiated, non-proliferative tissues 
for a prolonged period of time without fear of mutational insertions, deletions, or alterations in 

2 5 the cellular or mitochondrial genome. Long-term, but not necessarily permanent, transfer of 

therapeutic genes into specific cells may provide treatments for genetic diseases or for 
prophylactic use. The DNA could be reinjected periodically to maintain the gene product level 
without mutations occurring in the genomes of the recipient cells. Non-integration of 
exogenous DNAs may allow for the presence of several different exogenous DNA constructs 

3 0 within one cell with all of the constructs expressing various gene products. 

Particle-mediated gene transfer may also be employed for injecting DNA into cells, 
tissues and organs. With a particle bombardment device, or "gene gun," a motive force is 
generated to accelerate DNA-coated high density particles (such as gold or tungsten) to a high 
velocity that allows penetration of the target organs, tissues or cells. Electroporation for gene 
3 5 transfer uses an electrical current to make cells or tissues susceptible to electroporation- 

mediated gene transfer. A brief electric impulse with a given field strength is used to increase 
the permeability of a membrane in such a way that DNA molecules can penetrate into the cells. 
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The techniques of particle-mediated gene transfer and electroporation are well known to those 
of ordinary skill in the art. 

Chemical methods of gene therapy involve carrier mediated gene transfer through the 
use of fusogenic lipid vesicles such as liposomes or other vesicles for membrane fusion. A 
5 carrier harboring a DNA of interest can be conveniently introduced into body fluids or the 
bloodstream and then site specifically directed to the target organ or tissue in the body. 
Liposomes, for example, can be developed which are cell specific or organ specific. The 
foreign DNA carried by the liposome thus will be taken up by those specific cells. Injection of 
immunoliposomes that are targeted to a specific receptor on certain cells can be used as a 
1 0 convenient method of inserting the DNA into the cells bearing the receptor. Another carrier 
system that has been used is the asialoglycoprotein/polylysine conjugate system for carrying 
DNA to hepatocytes for in vivo gene transfer. 

Transfected DNA may also be complexed with other kinds of carriers so that the DNA 
is carried to the recipient cell and then resides in the cytoplasm or in the nucleoplasm of the 

1 5 recipient cell. DNA can be coupled to carrier nuclear proteins in specifically engineered vesicle 

complexes and carried directly into the nucleus. 

Carrier mediated gene transfer may also involve the use of lipid-based proteins which 
are not liposomes. For example, lipofectins and cytofectins are lipid-based positive ions that 
bind to negatively charged DNA, forming a complex that can ferry the DNA across a cell 

2 0 membrane. Another method of carrier mediated gene transfer involves receptor-based 

endocytosis. In this method, a ligand (specific to a cell surface receptor) is made to form a 
complex with a gene of interest and then injected into the bloodstream; target cells that have the 
cell surface receptor will specifically bind the ligand and transport the ligand-DNA complex 
into the cell. 

2 5 Biological gene therapy methodologies usually employ viral vectors to insert genes into 

cells. The term "vector" as used herein in the context of biological gene therapy means a 
carrier that can contain or associate with specific polynucleotide sequences and which 
functions to transport the specific polynucleotide sequences into a cell. The transfected cells 
may be cells derived from the patient's normal tissue, the patient's diseased tissue, or may be 

3 0 non-patient cells. Examples of vectors include plasmids and infective microorganisms such as 

viruses, or non-viral vectors such as the ligand-DNA conjugates, liposomes, and lipid-DNA 
complexes discussed above. 

It may be desirable that a recombinant DNA molecule comprising a MMP19 DNA 
sequence is operatively linked to an expression control sequence to form an expression vector 
3 5 capable of expressing MMP19. Alternatively, gene regulation of MMP19 may be 

accomplished by administering proteins that bind to control regions associated with the 
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MMP19 gene, or its corresponding RNA transcript to modify the rate of transcription or 
translation. 

Viral vectors that have been used for gene therapy protocols include but are not limited 
to, retroviruses, other RNA viruses such as poliovirus or Sindbis virus, adenovirus, adeno- 
5 associated virus, herpes viruses, SV 40, vaccinia and other DNA viruses. Replication- 
defective murine retroviral vectors are the most widely utilized gene transfer vectors. Murine 
leukemia retroviruses are composed of a single strand RNA complexed with a nuclear core 
protein and polymerase (pol) enzymes, encased by a protein core (gag) and surrounded by a 
glycoprotein envelope (env) that determines host range. The genomic structure of retroviruses 
1 0 include the gag, pol, and env genes enclosed at by 5' and 3' long terminal repeats (LTR). 

Retroviral vector systems exploit the fact that a minimal vector containing the 5* and 3' LTRs 
and the packaging signal are sufficient to allow vector packaging, infection and integration into 
target cells providing that the viral structural proteins are supplied in trans in the packaging cell 
line. Fundamental advantages of retroviral vectors for gene transfer include efficient infection 

1 5 and gene expression in most cell types, precise single copy vector integration into target cell 

chromosomal DNA, and ease of manipulation of the retroviral genome. For example, altered 
retrovirus vectors have been used in ex vivo methods to introduce genes into peripheral and 
tumor-infiltrating lymphocytes, hepatocytes, epidermal cells, myocytes, or other somatic cells 
(which may then introduced into the patient to provide the gene product from the inserted 

2 0 DNA). 

The adenovirus is composed of linear, double stranded DNA complexed with core 
proteins and surrounded with capsid proteins. Advances in molecular virology have led to the 
ability to exploit the biology of these organisms to create vectors capable of transducing novel 
genetic sequences into target cells in vivo . Adenoviral-based vectors will express gene 

2 5 product peptides at high levels. Adenoviral vectors have high efficiencies of infectivity, even 

with low titers of virus. Additionally, the virus is fully infective as a cell free virion so 
injection of producer cell lines are not necessary. Another potential advantage to adenoviral 
vectors is the ability to achieve long term expression of heterologous genes in vivo. 

Viral vectors have also been used to insert genes into cells using in vivo protocols. To 

3 0 direct tissue-specific expression of foreign genes, cis-acting regulatory elements or promoters 

that are known to be tissue specific can be used. Alternatively, this can be achieved using in 
situ delivery of DNA or viral vectors to specific anatomical sites in vivo. For example, gene 
transfer to blood vessels in vivo was achieved by implanting in vitro transduced endothelial 
cells in chosen sites on arterial walls. The virus infected surrounding cells which also 
3 5 expressed the gene product. A viral vector can be delivered directly to the in vivo site, by a 
catheter for example, thus allowing only certain areas to be infected by the virus, and 
providing long-term, site specific gene expression. In vivo gene transfer using retrovirus 
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vectors has also been demonstrated in mammary tissue and hepatic tissue by injection of the 
altered virus into blood vessels leading to the organs. 
Drug Screening . 

The present invention also provides a method of screening a plurality of compounds 
5 for specific binding to a MMP19 polypeptide, or any fragment thereof, to identify at least one 
compound which specifically binds the MMP19 polypeptide. Such a method comprises the 
steps of providing at least one compound; combining the MMP19 polypeptide with each 
compound under suitable conditions for a time sufficient to allow binding; and detecting 
MMP1 9 polypeptide binding to each compound. 
1 0 The present invention also provides a method of screening a plurality of compounds 

for inhibition of the activity of a MMP19 polypeptide, or any fragment thereof, to identify at 
least one compound which specifically inhibits the activity of the MMP19 polypeptide. Such a 
method comprises the steps of providing at least one compound; combining the MMP19 
polypeptide with each compound under suitable conditions for a time sufficient to allow 

1 5 interaction or binding; and detecting MMP19 polypeptide inhibition by each compound. 

The polypeptide or peptide fragment employed in such a test may either be free in 
solution, affixed to a solid support, borne on a cell surface or located intracellularly. One 
method of drug screening utilizes eukaryotic or prokaryotic host cells which are stably 
transformed with recombinant nucleic acids which can express the polypeptide or peptide 

2 0 fragment. Drugs may be screened against such transformed cells in competitive binding or 

enzymatic inhibition assays. For example, the formation of complexes between a polypeptide 
and the agent being tested can be measured in either viable or fixed cells. 

The present invention thus provides methods of screening for drugs or any other agent 
which can be used to treat conditions associated with cancer, arthritis, or inflammation 

2 5 resulting from abnormal MMP19 production. These methods comprise contacting the drug 

with a polypeptide or fragment thereof and assaying for either the presence of a complex 
between the agent and the polypeptide, or for the presence of a complex between the 
polypeptide and the cell. In competitive binding assays, the polypeptide typically is labeled. 
After suitable incubation, free (or uncomplexed) polypeptide or fragment thereof is separated 

3 0 from that present in bound form, and the amount of free or uncomplexed label is a measure of 

the ability of the particular drug to bind to polypeptide or to interfere with the polypeptide/cell 
complex. 

The present invention also encompasses the use of competitive drug screening assays 
in which neutralizing antibodies capable of binding polypeptide specifically compete with a test 
3 5 drug for binding to the polypeptide or fragment thereof. In this manner, the antibodies can be 
used to detect the presence of any polypeptide in the test sample which shares one or more 
antigenic determinants with a polypeptide provided herein. 
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Another technique for drug screening provides high throughput screening for 
compounds having suitable binding affinity to at least one polypeptide disclosed herein. 
Briefly, large numbers of different small peptide test compounds are synthesized on a solid 
phase, such as plastic pins or some other surface. The peptide test compounds are reacted 
5 with polypeptide and washed. Polypeptide thus bound to the solid phase is detected by 

methods well-known in the art. Purified polypeptide can also be coated directly onto plates for 
use in the drug screening techniques described herein. In addition, non-neutralizing antibodies 
can be used to capture the polypeptide and immobilize it on the solid support. See, for 
example, EP 84/03564, published on September 13, 1984, which is incorporated herein by 
1 0 reference. 

Drug Design 

The goal of rational drug design is to produce structural analogs of biologically active 
polypeptides of interest or of the small molecules including agonists, antagonists, or inhibitors 
with which they interact. Such structural analogs can be used to fashion drugs which are more 

1 5 active or stable forms of the polypeptide or which enhance or interfere with the function of a 

polypeptide in vivo. J. Hodgson, Bio/Technology 9:19-21 (1991), incorporated herein by 
reference. 

For example, in one approach, the three-dimensional structure of a polypeptide, or of a 
polypeptide-inhibitor complex, is determined by x-ray crystallography, by computer modeling 

2 0 or, most typically, by a combination of the two approaches. Both the shape and charges of the 

polypeptide must be ascertained to elucidate the structure and to determine active site(s) of the 
molecule. Less often, useful information regarding the structure of a polypeptide may be 
gained by modeling based on the structure of homologous proteins. In both cases, relevant 
structural information is used to design analogous polypeptide-like molecules or to identify 

2 5 efficient inhibitors 

Useful examples of rational drug design may include molecules which have improved 
activity or stability as shown by S. Braxton et al., Biochemistry 31 : 7796-7801 (1992), or 
which act as inhibitors, agonists, or antagonists of native peptides as shown by S. B. P. 
Athauda et al., J Biochem. (Tokvo) 113 (6):742-746 (1993), incorporated herein by reference. 

3 0 It also is possible to isolate a target-specific antibody, selected by an assay as described 

hereinabove, and then to determine its crystal structure. In principle this approach yields a 
pharmacophore upon which subsequent drug design can be based. It further is possible to 
bypass protein crystallography altogether by generating anti-idiotypic antibodies ("anti-ids") to 
a functional, pharmacologically active antibody. As a mirror image of a mirror image, the 
3 5 binding site of the anti-id is an analog of the original receptor. The anti-id then could be used 
to identify and isolate peptides from banks of chemically or biologically produced peptides. 
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The isolated peptides then can act as the pharmacophore (that is, a prototype pharmaceutical 
drug). 

A sufficient amount of a recombinant polypeptide of the present invention may be made 
available to perform analytical studies such as X-ray crystallography. In addition, knowledge 
5 of the polypeptide amino acid sequence which are derivable from the nucleic acid sequence 
provided herein will provide guidance to those employing computer modeling techniques in 
place of or in addition to x-ray crystallography. 

Antibodies specific to the MMP19 polypepeptide may further be used to inhibit the 
biological action of the polypeptide by binding to the polypeptide. In this manner, the 
1 0 antibodies may be used in therapy, for example, to treat cancer and its metastases. 

Further, such antibodies can detect the presence or absence of the MMP19 polypeptide 
and, therefore, are useful as diagnostic markers for the diagnosis of cancers, arthritis, and 
inflammatory conditions. Such antibodies may also function as a diagnostic marker for these 
conditions. The present invention also is directed to antagonists and inhibitors of the 

1 5 polypeptides of the present invention. The antagonists and inhibitors are those which inhibit 

or eliminate the function of the polypeptide. Thus, for example, an antagonist may bind to a 
polypeptide of the present invention and inhibit or eliminate its function. The antagonist, for 
example, could be an antibody against the polypeptide which eliminates the activity of the 
MMP 19 polypeptide by binding to the MMP19 polypeptide, or in some cases the antagonist 

2 0 may be an oligonucleotide. Examples of small molecule inhibitors include but are not limited 

to small peptides or peptide-like molecules. 

The antagonists and inhibitors may be employed as a composition with a 
pharmaceutically acceptable carrier, including but not limited to saline, buffered saline, 
dextrose, water, glycerol, ethanol and combinations thereof. Administration of MMP19 

2 5 polypeptide inhibitors are preferably systemic. The present invention also provides an 

antibody which inhibits the action of such polypeptide. 
Recombinant Technology . 

The present invention provides host cells and expression vectors comprising 
polynucleotides of the present invention and methods for the production of polypeptides they 

3 0 encode. Such methods comprise culturing the host cells under conditions suitable for the 

expression of the MMP1 9 polynucleotide and recovering the MMP19 polypeptide from the cell 
culture. 

The present invention also provides vectors which include polynucleotides of the 
present invention, host cells which are genetically engineered with vectors of the present 
3 5 invention and the production of polypeptides of the present invention by recombinant 
techniques. 
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Host cells are genetically engineered (transduced or transformed or transfected) with 
the vectors of this invention which may be a cloning vector or an expression vector. The 
vector may be in the form of a plasmid, a viral particle, a phage, etc. The engineered host cells 
can be cultured in conventional nutrient media modified as appropriate for activating 
5 promoters, selecting transformants or amplifying the MMP19 genes. The culture conditions, 
such as temperature, pH and the like, are those previously used with the host cell selected for 
expression, and will be apparent to the ordinarily skilled artisan. 

The polynucleotide of the present invention may be employed for producing a 
polypeptide by recombinant techniques. Thus, the polynucleotide sequence may be included 
10 in any one of a variety of expression vehicles, in particular vectors or plasmids for expressing 
a polypeptide. Such vectors include chromosomal, nonchromosomal and synthetic DNA 
sequences, e.g., derivatives of SV40; bacterial plasmids; phage DNA; yeast plasmids; vectors 
derived from combinations of plasmids and phage DNA, viral DNA such as vaccinia, 
adenovirus, fowl pox virus, and pseudorabies. However, any other plasmid or vector may be 

1 5 used so long as it is replicable and viable in the host. 

The appropriate DNA sequence may be inserted into the vector by a variety of 
procedures. In general, the DNA sequence is inserted into appropriate restriction endonuclease 
sites by procedures known in the art. Such procedures and others are deemed to be within the 
scope of those skilled in the art. The DNA sequence in the expression vector is operatively 

2 0 linked to an appropriate expression control sequence(s) (promoter) to direct mRNA synthesis. 

Representative examples of such promoters include but are not limited to LTR or SV40 
promoter, the E. coli lac or trp, the phage lambda P sub L promoter and other promoters 
known to control expression of genes in prokaryotic or eukaryotic cells or their viruses. The 
expression vector also contains a ribosome binding site for translation initiation and a 

2 5 transcription terminator. The vector may also include appropriate sequences for amplifying 

expression. In addition, the expression vectors preferably contain a gene to provide a 
phenotypic trait for selection of transformed host cells such as dihydrofolate reductase or 
neomycin resistance for eukaryotic cell culture, or such as tetracycline or ampicillin resistance 
in E. coli. 

3 0 The vector containing the appropriate DNA sequence as hereinabove described, as well 

as an appropriate promoter or control sequence, may be employed to transform an appropriate 
host to permit the host to express the protein. As representative examples of appropriate hosts, 
there may be mentioned: bacterial cells, such as E. coli . Salmonella typhimurium : 
Streptomvces sp. : fungal cells, such as yeast; insect cells such as Drosophila and Sf9; animal 
3 5 cells such as CHO, COS or Bowes melanoma; plant cells, etc. The selection of an appropriate 
host is deemed to be within the scope of those skilled in the art from the teachings provided 
herein. 
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More particularly, the present invention also includes recombinant constructs 
comprising one or more of the sequences as broadly described above. The constructs 
comprise a vector, such as a plasmid or viral vector, into which a sequence of the invention 
has been inserted, in a forward or reverse orientation. In a preferred aspect of this 
5 embodiment, the construct further comprises regulatory sequences, including, for example, a 
promoter, operably linked to the sequence. Large numbers of suitable vectors and promoters 
are known to those of skill in the art, and are commercially available. The following vectors 
are provided by way of example. Bacterial: pSPORTl (Life Technologies, Gaithersburg, 
MD), pQE70, pQE60, pQE-9 (Qiagen) pBs, phagescript, psiX174, pBluescript SK, pBsKS, 
1 0 pNH8a, pNH16a, pNH18a, pNH46a (Stratagene); pTrc99A, pKK223-3, pKK233-3, 
pDR540, pRIT5 (Pharmacia). Eukaryotic: pWLneo, pSV2cat, pOG44, pXTl, pSG 
(Stratagene) pSVK3, pBPV, pMSG, pSVL (Pharmacia). However, any other plasmid or 
vector may be used as long as it is replicable and viable in the host. 

Promoter regions can be selected from any desired gene using CAT (chloramphenicol 

1 5 transferase) vectors or other vectors with selectable markers. Two appropriate vectors are 

pKK232-8 and pCM7. Particular named bacterial promoters include lad, lacZ, T3, SP6, T7, 
gpt, lambda P sub R, P sub L and trp. Eukaryotic promoters include cytomegalovirus (CMV) 
immediate early, herpes simplex virus (HSV) thymidine kinase, early and late SV40, LTRs 
from retrovirus, and mouse metallothionein-L Selection of the appropriate vector and 

2 0 promoter is well within the level of ordinary skill in the art. 

In a further embodiment, the present invention provides host cells containing the 
above-described construct. The host cell can be a higher eukaryotic cell, such as a mammalian 
cell, or a lower eukaryotic cell, such as a yeast cell, or the host cell can be a prokaryotic cell, 
such as a bacterial cell. Introduction of the construct into the host cell can be effected by 

2 5 calcium phosphate transfection, DEAE-Dextran mediated transfection, or electroporation (L. 

Davis et al., "Basic Methods in Molecular Biology", 2nd edition, Appleton and Lang, 
Paramount Publishing, East Norwalk, CT (1994). 

The constructs in host cells can be used in a conventional manner to produce the gene 
product encoded by the recombinant sequence. Alternatively, the polypeptides of the invention 

3 0 can be synthetically produced by conventional peptide synthesizers. 

Proteins can be expressed in mammalian cells, yeast^ bacteria, or other cells under the 
control of appropriate promoters. Cell-free translation systems can also be employed to 
produce such proteins using RNAs derived from the DNA constructs of the present invention. 
Appropriate cloning and expression vectors for use with prokaryotic and eukaryotic hosts are 
3 5 described by Sambrook et al., Molecular Cloning: A Laboratory Manual , Second Edition, 
(Cold Spring Harbor, N.Y., 1989), which is hereby incorporated by reference. 
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Transcription of a DNA encoding the polypeptides of the present invention by higher 
eukaryotes is increased by inserting an enhancer sequence into the vector. Enhancers are cis- 
acting elements of DNA, usually about from 10 to 300 bp, that act on a promoter to increase its 
transcription. Examples include the SV40 enhancer on the late side of the replication origin 
5 (bp 100 to 270), a cytomegalovirus early promoter enhancer, a polyoma enhancer on the late 
side of the replication origin, and adenovirus enhancers. 

Generally, recombinant expression vectors will include origins of replication and 
selectable markers permitting transformation of the host cell, e.g., the ampicillin resistance 
gene of E. coli and S. cerevisiae TRP1 gene, and a promoter derived from a highly-expressed 
1 0 gene to direct transcription of a downstream structural sequence. Such promoters can be 

derived from operons encoding glycolytic enzymes such as 3-phosphoglycerate kinase (PGK), 
alpha factor, acid phosphatase, or heat shock proteins, among others. The heterologous 
structural sequence is assembled in appropriate phase with translation initiation and termination 
sequences, and preferably, a leader sequence capable of directing secretion of translated 

1 5 protein into the periplasmic space or extracellular medium. Optionally, the heterologous 

sequence can encode a fusion protein including an N-terminal identification peptide imparting 
desired characteristics, e.g., stabilization or simplified purification of expressed recombinant 
product. 

Useful expression vectors for bacterial use are constructed by inserting a structural 

2 0 DNA sequence encoding a desired protein together with suitable translation initiation and 

termination signals in operable reading phase with a functional promoter. The vector will 
comprise one or more phenotypic selectable markers and an origin of replication to ensure 
maintenance of the vector and to, if desirable, provide amplification within the host. Suitable 
prokaryotic hosts for transformation include E. coli. Bacillus subtilis . Salmonella typhimurium 

2 5 and various species within the genera Pseudomonas. Streptomyces . and Staphylococcus . 

although others may also be employed as a routine matter of choice. 

Useful expression vectors for bacterial use comprise a selectable marker and bacterial 
origin of replication derived from plasmids comprising genetic elements of the well-known 
cloning vector pBR322 (ATCC 37017). Other vectors include but are not limited to PKK223- 

3 0 3 (Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM1 (Promega Biotec, Madison, WI). 

These pBR322 "backbone" sections are combined with an appropriate promoter and the 
structural sequence to be expressed. 

Following transformation of a suitable host strain and growth of the host strain to an 
appropriate cell density, the selected promoter is derepressed by appropriate means (e.g., 
3 5 temperature shift or chemical induction), and cells are cultured for an additional period. Cells 
are typically harvested by centrifugation, disrupted by physical or chemical means, and the 
resulting crude extract retained for further purification. Microbial cells employed in expression 
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of proteins can be disrupted by any convenient method, including freeze-thaw cycling, 
sonication, mechanical disruption, or use of cell lysing agents; such methods are well-known 
to the ordinary artisan. 

Various mammalian cell culture systems can also be employed to express recombinant 
5 protein. Examples of mammalian expression systems include the COS-7 lines of monkey 

kidney fibroblasts described by Gluzman, Cell 23:175 (1981), and other cell lines capable of 
expressing a compatible vector, such as the CI 27, 3T3, CHO, HeLa and BHK cell lines. 
Mammalian expression vectors will comprise an origin of replication, a suitable promoter and 
enhancer, and also any necessary ribosome binding sites, polyadenylation site, splice donor 
1 0 and acceptor sites, transcriptional termination sequences, and 5' flanking nontranscribed 

sequences. DNA sequences derived from the SV40 viral genome, for example, S V40 origin, 
early promoter, enhancer, splice, and polyadenylation sites may be used to provide the 
required nontranscribed genetic elements. Representative, useful vectors include pRc/CMV 
and pcDNA3 (available from Invitrogen, San Diego, CA). 

1 5 MMP19 polypeptide is recovered and purified from recombinant cell cultures by 

known methods including ammonium sulfate or ethanol precipitation, acid extraction, anion or 
cation exchange chromatography, phosphocellulose chromatography, hydrophobic interaction 
chromatography, hydroxyapatite chromatography or lectin chromatography. It is preferred to 
have low concentrations (approximately 0.1-5 mM) of calcium ion present during purification 

2 0 (Price, et al., J. Biol. Chem . 244:917 [1969]). Protein refolding steps can be used, as 

necessary, in completing configuration of the protein. Finally, high performance liquid 
chromatography (HPLC) can be employed for final purification steps. 

The polypeptides of the present invention may be naturally purified products expressed 
from a high expressing cell line, or a product of chemical synthetic procedures, or produced by 

2 5 recombinant techniques from a prokaryotic or eukaryotic host (for example, by bacterial, 

yeast, higher plant, insect and mammalian cells in culture). Depending upon the host 
employed in a recombinant production procedure, the polypeptides of the present invention 
may be glycosylated with mammalian or other eukaryotic carbohydrates or may be non- 
glycosylated. The polypeptides of the invention may also include an initial methionine amino 

3 0 acid residue. 

The starting plasmids can be constructed from available plasmids in accord with 
published, known procedures. In addition, equivalent plasmids to those described are known 
in the art and will be apparent to the ordinarily skilled artisan. 

The following is the general procedure for the isolation and analysis of cDNA clones. 
3 5 In a particular embodiment disclosed herein, an EST homologous to a portion of human 
MMPs was identified by comparison with a consensus sequence to human MMPs. 
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The cDNA insert was sequenced in entirety, analyzed in detail set forth in the Examples 
and is disclosed in the Sequence Listing as SEQ NO: 4. These polynucleotides may contain an 
entire open reading frame with or without associated regulatory sequences for a particular 
gene, or they may encode only a portion of the gene of interest. This is attributed to the fact 
5 that many genes are several hundred, and sometimes several thousand, bases in length and, 
with current technology, cannot be cloned in their entirety because of vector limitations, 
incomplete reverse transcription of the first strand, or incomplete replication of the second 
strand. Contiguous, secondary clones containing additional nucleotide sequence may be 
obtained using a variety of methods known to those of skill in the art. 

1 0 Methods for DNA sequencing are well known in the art. Conventional enzymatic 

methods employ DNA polymerase, Klenow fragment, Sequenase (US Biochemical Corp, 
Cleveland, OH) or Taq polymerase to extend DNA chains from an oligonucleotide primer 
annealed to the DNA template of interest. Methods have been developed for the use of both 
single-stranded and double-stranded templates. The chain termination reaction products may 
15 be electrophoresed on urea/polyacrylamide gels and detected either by autoradiography (for 
radionucleotide labeled precursors) or by fluorescence (for fluorescent-labeled precursors). 
Recent improvements in mechanized reaction preparation, sequencing and analysis using the 
fluorescent detection method have permitted expansion in the number of sequences that can be 
determined per day using machines such as the Applied Biosystems 377 DNA Sequencers 

2 0 (Applied Biosystems, Foster City, CA). 

The reading frame of the nucleotide sequence can be ascertained by several types of 
analyses. First, reading frames contained within the coding sequence can be analyzed for the 
presence of start codon ATG and stop codons TGA, TAA or TAG. Typically, one reading 
frame will continue throughout the major portion of a cDNA sequence while the other two 

2 5 reading frames tend to contain numerous stop codons. In such cases reading frame 

determination is straightforward. In other more difficult cases, further analysis is required. 

Algorithms have been created to analyze the occurrence of individual nucleotide bases 
at each putative codon triplet. See, for example J. W. Fickett, Nuc Acids Res 10:5303 
( 1 982). Coding DNA for particular organisms (bacteria, plants, and animals) tends to contain 

3 0 certain nucleotides within certain triplet periodicities, such as a significant preference for 

pyrimidines in the third codon position. These preferences have been incorporated into widely 
available software which can be used to determine coding potential (and frame) of a given 
stretch of DNA. The algorithm-derived information combined with start/stop codon 
information can be used to determine proper frame with a high degree of certainty. This, in 
3 5 turn, readily permits cloning of the sequence in the correct reading frame into appropriate 
expression vectors. 
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The nucleic acid sequences disclosed herein may be joined to a variety of other 
polynucleotide sequences and vectors of interest by means of well established recombinant 
DNA techniques. See J. Sambrook et al, supra. Vectors of interest include cloning vectors, 
such as plasmids, cosmids, phage derivatives, phagemids, as well as sequencing, replication, 
5 and expression vectors, and the like. In general, such vectors contain an origin of replication 
functional in at least one organism, convenient restriction endonuclease digestion sites, and 
selectable markers appropriate for particular host cells. The vectors can be transferred by a 
variety of means known to those of skill in the art into suitable host cells which then produce 
the desired DNA, RNA or polypeptides. 
1 0 Occasionally, sequencing or random reverse transcription errors will mask the presence 

of the appropriate open reading frame or regulatory element. In such cases, it is possible to 
determine the correct reading frame by attempting to express the polypeptide and determining 
the amino acid sequence by standard peptide mapping and sequencing techniques. See, F. M. 
Ausubel, et al., Current Protocols in Molecular Biology . John Wiley & Sons, New York, NY 

1 5 (1989). Additionally, the actual reading frame of a given nucleotide sequence may be 

determined by transfection of host cells with vectors containing all three potential reading 
frames. Only those cells with the nucleotide sequence in the correct reading frame will 
produce a peptide of the predicted length. 

The nucleotide sequences provided herein have been prepared by current, state-of-the- 

2 0 art, automated methods and as such may contain unidentified nucleotides. These will not 

present a problem to those skilled in the art who wish to practice the invention. Several 
methods employing standard recombinant techniques, described in J. Sambrook ( supra) or 
periodic updates thereof, may be used to complete the missing sequence information. The 
same techniques used for obtaining a full length sequence, as described herein, may be used to 

2 5 obtain nucleotide sequence. 

Expression of a particular cDNA may be accomplished by subcloning the cDNA into 
an appropriate expression vector and transfecting this vector into an appropriate expression 
host. The cloning vector used for the generation of the cDNA library can be used for 
transcribing mRNA of a particular cDNA and contains a promoter for beta-galactosidase, an 

3 0 amino-terminal met and the subsequent seven amino acid residues of beta-galactosidase. 

Immediately following these eight residues is an engineered bacteriophage promoter useful for 
artificial priming and transcription and a number of unique restriction sites, including EcoR I, 
for cloning. The vector can be transfected into an appropriate host strain of E. coli . 

Induction of the isolated bacterial strain with isopropylthiogalactoside (IPTG) using 
3 5 standard methods will produce a fusion protein which contains the first seven residues of beta- 
galactosidase, about 15 residues of linker, and the peptide encoded within the cDNA. Since 
cDNA clone inserts are generated by an essentially random process, there is one chance in 



WO 98/40475 PCT/US98/04694 



38 

three that the included cDNA will lie in the correct frame for proper translation. If the cDNA is 
not in the proper reading frame, the correct frame can be obtained by deletion or insertion of an 
appropriate number of bases by well known methods including in vitro mutagenesis, digestion 
with exonuclease HI or mung bean nuclease, or oligonucleotide linker inclusion. 
5 The cDNA can be shuttled into other vectors known to be useful for expression of 

protein in specific hosts. Oligonucleotide primers containing cloning sites and segments of 
DNA sufficient to hybridize to stretches at both ends of the target cDNA can be synthesized 
chemically by standard methods. These primers can then be used to amplify the desired gene 
segments by PCR. The resulting new gene segments can be digested with appropriate 
1 0 restriction enzymes under standard conditions and isolated by gel electrophoresis. Alternately, 
similar gene segments can be produced by digestion of the cDNA with appropriate restriction 
enzymes and filling in the missing gene segments with chemically synthesized 
oligonucleotides. Segments of the coding sequence from more than one gene can be ligated 
together and cloned in appropriate vectors to optimize expression of recombinant sequence. 

1 5 Suitable expression hosts for such chimeric molecules include but are not limited to, 

mammalian cells such as Chinese Hamster Ovary (CHO) and human 293 cells, insect cells 
such as Sf9 cells, yeast cells such as Saccharomvces cerevisiae> and bacteria such as E. coli . 
For each of these cell systems, a useful expression vector may also include an origin of 
replication to allow propagation in bacteria and a selectable marker such as the beta-lactamase 

2 0 antibiotic resistance gene to allow selection in bacteria. In addition, the vectors may include a 

second selectable marker such as the neomycin phosphotransferase gene to allow selection in 
transfected eukaryotic host cells. Vectors for use in eukaryotic expression hosts may require 
the addition of 3' poly A tail if the sequence of interest lacks poly A. 

Additionally, the vector may contain promoters or enhancers which increase gene 

2 5 expression. Such promoters are host specific and include but are not limited to MMTV, 

SV40, or metallothionine promoters for CHO cells; trp, lac, tac or T7 promoters for bacterial 
hosts; or alpha factor, alcohol oxidase or PGH promoters for yeast. Adenoviral vectors with 
or without transcription enhancers, such as the rous sarcoma virus (RSV) enhancer, may be 
used to drive protein expression in mammalian cell lines. Once homogeneous cultures of 

3 0 recombinant cells are obtained, large quantities of recombinantly produced protein can be 

recovered from the conditioned medium and analyzed using chromatographic methods well 
known in the art. An alternative method for the production of large amounts of secreted 
protein involves the transformation of mammalian embryos and the recovery of the 
recombinant protein from milk produced by transgenic cows, goats, sheep, etc. Polypeptides 
3 5 and closely related molecules may be expressed recombinantly in such a way as to facilitate 
protein purification. One approach involves expression of a chimeric protein which includes 
one or more additional polypeptide domains not naturally present on human polypeptides. 
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Such purification-facilitating domains include, but are not limited to, metal-chelating peptides 
such as histidine-tryptophan domains that allow purification on immobilized metals, protein A 
domains that allow purification on immobilized immunoglobulin, and the domain utilized in the 
FLAGS extension/affinity purification system (Immunex Corp, Seattle, WA). The inclusion 
5 of a cleavable linker sequence such as Factor XA or enterokinase from Invitrogen (San Diego, 
CA) between the polypeptide sequence and the purification domain may be useful for 
recovering the polypeptide. 
Immunoassays. 

The polypeptides including their fragments or derivatives or analogs thereof of the 
1 0 present invention, or cells expressing them, can be in a variety of assays, many of which are 
described herein, for the detection of antibodies. They also can be used as an immunogen to 
produce antibodies. These antibodies can be, for example, polyclonal or monoclonal 
antibodies, chimeric, single chain and humanized antibodies, as well as Fab fragments, or the 
product of an Fab expression library. Various procedures known in the art may be used for 

1 5 the production of such antibodies and fragments. 

For example, antibodies generated against a polypeptide corresponding to a sequence 
of the present invention can be obtained by direct injection of the polypeptide into an animal or 
by administering the polypeptide to an animal such as a mouse, rabbit, goat or human. A 
mouse, rabbit or goat is preferred. The antibody so obtained then will bind the polypeptide 

2 0 itself. In this manner, even a sequence encoding only a fragment of the polypeptide can be 

used to generate antibodies that bind the native polypeptide. Such antibodies can then be used 
to isolate the polypeptide from test samples such as tissue suspected of containing that 
polypeptide. For preparation of monoclonal antibodies, any technique which provides 
antibodies produced by continuous cell line cultures can be used. Examples include the 

2 5 hybridoma technique as described by Kohler and Milstein, Nature 256:495-497 (1975), the 

trioma technique, the human B-cell hybridoma technique as described by Kozbor et al, 
Immun. Today 4:72 (1983), and the EBV-hybridoma technique to produce human monoclonal 
antibodies as described by Cole, et al., in Monoclonal Antibodies and Cancer Therapy . Alan 
R. Liss, Inc, New York, NY, pp. 77-96 (1985). Techniques described for the production of 

3 0 single chain antibodies can be adapted to produce single chain antibodies to immunogenic 

polypeptide products of this invention. See, for example, U.S. Pat. No. 4,946,778, which is 
incorporated herein by reference. 

1 Various assay formats may utilize the antibodies of the present invention, including 
"sandwich" immunoassays and probe assays. For example, the monoclonal antibodies or 
3 5 fragment thereof of the present invention can be employed in various assay systems to 

determine the presence, if any, of MMP19 polypeptide in a test sample. For example, in a first 
assay format, a polyclonal or monoclonal antibody or fragment thereof, or a combination of 
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these antibodies, which has been coated on a solid phase, is contacted with a test sample, to 
form a first mixture. This first mixture is incubated for a time and under conditions sufficient 
to form antigen/antibody complexes. Then, an indicator reagent comprising a monoclonal or a 
polyclonal antibody or a fragment thereof, or a combination of these antibodies, to which a 
5 signal generating compound has been attached, is contacted with the antigen/antibody 

complexes to form a second mixture. This second mixture then is incubated for a time and 
under conditions sufficient to form antibody/antigen/antibody complexes. The presence of a 
MMP19 polypeptide antigen present in the test sample and captured on the solid phase, if any, 
is determined by detecting the measurable signal generated by the signal generating compound. 
1 0 The amount of MMP1 9 polypeptide antigen present in the test sample is proportional to the 
signal generated. 

Or, a polyclonal or monoclonal MMP19 polypeptide antibody or fragment thereof, or a 
combination of these antibodies which is bound to a solid support, the test sample and an 
indicator reagent comprising a monoclonal or polyclonal antibody or fragments thereof, which 

1 5 specifically binds to MMP19 polypeptide antigen, or a combination of these antibodies to 

which a signal generating compound is attached, are contacted to form a mixture. This mixture 
is incubated for a time and under conditions sufficient to form antibody/antigen/antibody 
complexes. The presence, if any, of MMP19 polypeptide present in the test sample and 
captured on the solid phase is determined by detecting the measurable signal generated by the 

2 0 signal generating compound. The amount of MMP19 polypeptide proteins present in the test 

sample is proportional to the signal generated. 

In another assay format, one or a combination of at least two monoclonal antibodies of 
the invention can be employed as a competitive probe for the detection of antibodies to MMP19 
polypeptide protein. For example, MMP19 polypeptide proteins such as the recombinant 

2 5 antigens disclosed herein, either alone or in combination, are coated on a solid phase. A test 

sample suspected of containing antibody to MMP19 polypeptide antigen then is incubated with 
an indicator reagent comprising a signal generating compound and at least one monoclonal 
antibody of the invention for a time and under conditions sufficient to form antigen/antibody 
complexes of either the test sample and indicator reagent bound to the solid phase or the 

3 0 indicator reagent bound to the solid phase. The reduction in binding of the monoclonal 

antibody to the solid phase can be quantitatively measured. 

In yet another detection method, each of the monoclonal or polyclonal antibodies of the 
present invention can be employed in the detection of MMP 19 polypeptide antigens in fixed 
tissue sections, as well as fixed cells by immunohistochemical analysis. Cytochemical 
3 5 analysis wherein these antibodies are labeled directly (with, for example, fluorescein, colloidal 
gold, horseradish peroxidase, alkaline phosphatase, etc.) or are labeled by using secondary 
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labeled anti-species antibodies (with various labels as exemplified herein) to track the 
histopathology of disease also are within the scope of the present invention. 

In addition, these monoclonal antibodies can be bound to matrices similar to CNBr- 
activated Sepharose and used for the affinity purification of specific MMP19 polypeptide 
5 proteins from cell cultures or biological tissues such as to purify recombinant and native 
MMP19 polypeptide antigens and proteins. 

The monoclonal antibodies of the invention can also be used for the generation of 
chimeric antibodies for therapeutic use, or other similar applications. 

The monoclonal antibodies or fragments thereof can be provided individually to detect 
1 0- MMP19 polypeptide antigens. Combinations of the monoclonal antibodies (and fragments 

thereof) provided herein also may be used together as components in a mixture or "cocktail" of 
at least one MMP19 polypeptide antibody of the invention with antibodies to other MMP19 
polypeptide regions, each having different binding specificities. Thus, this cocktail can 
include the monoclonal antibodies of the invention which are directed to MMP19 polypeptide 

1 5 proteins and other monoclonal antibodies to other antigenic determinants of the MMP19 

polypeptide genome. 

The polyclonal antibody or fragment thereof which can be used in the assay formats 
should specifically bind to a MMP19 polypeptide region or other MMP19 polypeptide proteins 
used in the assay. The polyclonal antibody used preferably is of mammalian origin; human, 

2 0 goat, rabbit or sheep anti- MMP19 polypeptide polyclonal antibody can be used. Most 

preferably, the polyclonal antibody is rabbit polyclonal anti-MMP19 polypeptide antibody. 
The polyclonal antibodies used in the assays can be used either alone or as a cocktail of 
polyclonal antibodies. Since the cocktails used in the assay formats are comprised of either 
monoclonal antibodies or polyclonal antibodies having different MMP19 polypeptide 

2 5 specificity, they would be useful for diagnosis, evaluation and prognosis of MMP1 9 

polypeptide condition, as well as for studying MMP19 polypeptide protein differentiation and 
specificity. 

It is contemplated and within the scope of the present invention that the MMP19 
polypeptide may be detectable in assays by use of a recombinant antigen as well as by use of a 

3 0 synthetic peptide or purified peptide, which contains an amino acid sequence of MMP1 9 

polypeptide. It also is within the scope of the present invention that different synthetic, 
recombinant or purified peptides identifying different epitopes of the MMP19 polypeptide can 
be used in combination in an assay to diagnose, evaluate, or prognose the disease condition. 
In this case, these peptides can be coated onto one solid phase, or each separate peptide may be 
3 5 coated on separate solid phases, such as microparticles, and then combined to form a mixture 
of peptides which can be later used in assays. Furthermore, it is contemplated that multiple 
peptides which define epitopes from different polypeptides may be used in combination to 
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make a diagnosis, evaluation, or prognosis of disease. Peptides coated on solid phases or. 
labelled with detectable lables are then allowed to compete with peptides from a patient sample 
for a limited amount of antibody. A reduction in binding of the synthetic, recombinant, or 
purified peptides to the antibody (or antibodies) is an indication of the presence of MMP19 
5 polypeptides in the patient sample which in turn indicates the presence of disease in the patient. 
Such variations of assay formats are known to those of ordinary skill in the art and are 
discussed herein below. 

In another assay format, the presence of antibody and/or antigen to MMP19 
polypeptide can be detected in a simultaneous assay, as follows. A test sample is 
1 0 simultaneously contacted with a capture reagent of a first analyte, wherein said capture reagent 
comprises a first binding member specific for a first analyte attached to a solid phase and a 
capture reagent for a second analyte, wherein said capture reagent comprises a first binding 
member for a second analyte attached to a second solid phase, to thereby form a mixture. This 
mixture is incubated for a time and under conditions sufficient to form capture reagent/first 

1 5 analyte and capture reagent/second analyte complexes. These so-formed complexes then are 

contacted with an indicator reagent comprising a member of a binding pair specific for the first 
analyte labeled with a signal generating compound and an indicator reagent comprising a 
member of a binding pair specific for the second analyte labeled with a signal generating 
compound to form a second mixture. This second mixture is incubated for a time and under 

2 0 conditions sufficient to form capture reagent/first analyte/indicator reagent complexes and 

capture reagent/second analyte/indicator reagent complexes. The presence of one or more 
analytes is determined by detecting a signal generated in connection with the complexes formed 
on either or both solid phases as an indication of the presence of one or more analytes in the 
test sample. In this assay format, recombinant antigens derived from human expression 

2 5 systems may be utilized as well as monoclonal antibodies produced from the proteins derived 

from the mammalian expression systems as disclosed herein. Such assay systems are 
described in greater detail in EP Publication No. 0473065. 

In yet other assay formats, the polypeptides disclosed herein may be utilized to detect 
the presence of anti- MMP19 polypeptide in test samples. For example, a test sample is 

3 0 incubated with a solid phase to which at least one recombinant protein has been attached. 

These are reacted for a time and under conditions sufficient to form antigen/antibody 
complexes. Following incubation, the antigen/antibody complex is detected. Indicator 
reagents may be used to facilitate detection, depending upon the assay system chosen. In 
another assay format, a test sample is contacted with a solid phase to which a recombinant 
3 5 protein produced as described herein is attached and also is contacted with a monoclonal or 
polyclonal antibody specific for the protein, which preferably has been labeled with an 
indicator reagent. After incubation for a time and under conditions sufficient for 
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antibody /antigen complexes to form, the solid phase is separated from the free phase, and the 
label is detected in either the solid or free phase as an indication of the presence of MMP19 
polypeptide antibody. Other assay formats utilizing the recombinant antigens disclosed herein 
are contemplated. These include contacting a test sample with a solid phase to which at least 
5 one antigen from a first source has been attached, incubating the solid phase and test sample 
for a time and under conditions sufficient to form antigen/antibody complexes, and then 
contacting the solid phase with a labeled antigen, which antigen is derived from a second 
source different from the first source. For example, a recombinant protein derived from a first 
source such as E. coli is used as a capture antigen on a solid phase, a test sample is added to 
1 0 the so-prepared solid phase, and a recombinant protein derived from a different source (i.e., 
non- E. colfl is utilized as a part of an indicator reagent. Likewise, combinations of a 
recombinant antigen on a solid phase and synthetic peptide in the indicator phase also are 
possible. Any assay format which utilizes an antigen specific for MMP19 polypeptide from a 
first source as the capture antigen and an antigen specific for MMP19 polypeptide from a 

1 5 different second source are contemplated. Thus, various combinations of recombinant 

antigens, as well as the use of synthetic peptides, purified proteins, and the like, are within the 
scope of this invention. Assays such as this and others are described in U.S. Patent No. 
5,254,458, which enjoys common ownership and is incorporated herein by reference. 

Other embodiments which utilize various other solid phases also are contemplated and 

2 0 are within the scope of this invention. For example, ion capture procedures for immobilizing 

an immobilizable reaction complex with a negatively charged polymer (described in EP 
publication 0326100 and EP publication No. 0406473), can be employed according to the 
present invention to effect a fast solution-phase immunochemical reaction. An immobilizable 
immune complex is separated from the rest of the reaction mixture by ionic interactions 

2 5 between the negatively charged poly-anion/immune complex and the previously treated, 

positively charged porous matrix and detected by using various signal generating systems 
previously described, including those described in chemiluminescent signal measurements as 
described in EPO Publication No. 0 273,1 15. 

Also, the methods of the present invention can be adapted for use in systems which 

3 0 utilize microparticle technology including in automated and semi-automated systems wherein 

the solid phase comprises a microparticle (magnetic or non-magnetic). Such systems include 
those described in published EPO applications Nos. EP 0 425 633 and EP 0 424 634, 
respectively. 

The use of scanning probe microscopy (SPM) for immunoassays also is a technology 
3 5 to which the monoclonal antibodies of the present invention are easily adaptable. In scanning 
probe microscopy, in particular in atomic force microscopy, the capture phase, for example, at 
least one of the monoclonal antibodies of the invention, is adhered to a solid phase and a 
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scanning probe microscope is utilized to detect antigen/antibody complexes which may be 
present on the surface of the solid phase. The use of scanning tunneling microscopy 
eliminates the need for labels which normally must be utilized in many immunoassay systems 
to detect antigen/antibody complexes. The use of SPM to monitor specific binding reactions 
5 can occur in many ways. In one embodiment, one member of a specific binding partner 

(analyte specific substance which is the monoclonal antibody of the invention) is attached to a 
surface suitable for scanning. The attachment of the analyte specific substance may be by 
adsorption to a test piece which comprises a solid phase of a plastic or metal surface, following 
methods known to those of ordinary skill in the art. Or, covalent attachment of a specific 
1 0 binding partner (analyte specific substance) to a test piece which test piece comprises a solid 
phase of deri vatized plastic, metal, silicon, or glass may be utilized. Covalent attachment 
methods are known to those skilled in the art and include a variety of means to irreversibly link 
specific binding partners to the test piece. If the test piece is silicon or glass, the surface must 
be activated prior to attaching the specific binding partner. Also, polyelectrolyte interactions 

1 5 may be used to immobilize a specific binding partner on a surface of a test piece by using 

techniques and chemistries. The preferred method of attachment is by covalent means. 
Following attachment of a specific binding member, the surface may be further treated with 
materials such as serum, proteins, or other blocking agents to minimize non-specific binding. 
The surface also may be scanned either at the site of manufacture or point of use to verify its 

2 0 suitability for assay purposes. The scanning process is not anticipated to alter the specific 

binding properties of the test piece. 

While the present invention discloses the preference for the use of solid phases, it is 
contemplated that the reagents such as antibodies, proteins and peptides of the present 
invention can be utilized in non-solid phase assay systems. These assay systems are known to 

2 5 those skilled in the art, and are considered to be within the scope of the present invention. 

It is contemplated that the reagent employed for the assay can be provided in the form 
of a test kit with one or more containers such as vials or bottles, with each container containing 
a separate reagent such as a probe, primer, monoclonal antibody or a cocktail of monoclonal 
antibodies, or a polypeptide (either recombinant or synthetic) employed in the assay. Other 

3 0 components such as buffers, controls, and the like, known to those of ordinary skill in art, 

may be included in such test kits. It also is contemplated to provide test kits which have means 
for collecting test samples comprising accessible body fluids, eg. blood, urine, saliva, and 
stool. Such collection means include lancets and absorbent paper or cloth for collecting and 
stabilizing blood; swabs for collecting and stabilizing saliva; cups for collecting and stabilizing 
3 5 urine or stool samples. Collection materials, papers, cloths, swabs, cups and the like, may 
optionally be treated to avoid denaturation or irreversible adsorption of the sample. The 
collection materials also may be treated with or contain preservatives, stabilizers or 
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antimicrobial agents to help maintain the integrity of the specimens. Test kits designed for the 
collection, stabilization, and preservation of test specimens obtained by surgery or needle 
biopsy are also useful. It is contemplated that all kits may be configured in two components; 
one component for collection and transport of the specimen, and the other component for the 
5 analysis of the specimen. Further, kits for the collection, stabilization, and preservation of test 
specimens may be configured for use by untrained personnel and may be available in the open 
market for use at home with subsequent transportation to a laboratory for analysis of the test 
sample. 

E. coli bacteria (clone #907334) has been deposited at the American Type Culture 
1 0 Collection (A.T.C.C.), 12301 Parklawn Drive, Rockville, Maryland 20852, as of March 10, 
1997, under the terms of the Budapest Treaty and will be maintained for a period of thirty (30) 
years from the date of deposit, or for five (5) years after the last request for the deposit, or for 
the enforceable period of the U.S. patent, whichever is longer. The deposit and any other 
deposited material described herein are provided for convenience only, and are not required to 

1 5 practice the present invention in view of the teachings provided herein. The cDNA sequence in 

all of the deposited material is incorporated herein by reference. Clone #907334 was accorded 

A.T.C.C. Deposit No . 

Having now generally described the invention, a complete understanding can be 
obtained by reference to the following specific examples. The following examples are given 

2 0 for the purpose of illustrating various embodiments of the invention and are not meant to limit 

the present invention in any fashion. 

Exam ple 1 : Isolation of Human cDNA Clones Homologous to MMPs 

2 5 A consensus of twelve human matrix metal loprotease amino acid sequences was 

obtained by aligning sequences using the PILEUP program in the Wisconsin Sequence 
Analysis Package (Genetics Computer Group [GCG], Madison, WI). The consensus 
sequence was derived from a plurality of seven sequences at a given amino acid position. The 
letter "x" was substituted for each amino acid that differed from the determined consensus 

3 0 residue and was used to represent any amino acid. 

The MMP consensus sequence (SEQ ID NO:7, see FIG. 1) was used to search the 
LifeSeq™ human expression database (Incyte Pharmaceuticals, Inc., Palo Alto, CA) for 
human MMP sequences. This database is comprised of partial sequences of cDNA clone 
inserts, so-called expressed sequence tags (ESTs), derived from cDNA libraries from various 
3 5 human tissues. A search of LIFESEQ™ using the BLAST and Smith-Waterman similarity 

search algorithms with the MMP consensus sequence identified an EST (from clone #907334) 
unique to the LIFESEQ™ database and whose deduced amino acid sequence of one reading 
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frame contained the amino acid sequence PRCGVTD (SEQ ID NO:8), similar to the "cysteine 
switch" motif in the propeptide region of all MMPs. The EST from clone #907334 was 
derived from a colon tissue library using oligo-dT for the reverse transcription reaction. The 
cDNA clone #907334 in pSPORTl plasmid (GIBCO BRL, Gaithersburg, MD) was obtained 
5 for further study. 

Clone #907334 was amplified in bacteria and digested with the restriction enzymes 
EcoRI and NotI to determine the size of the cDNA insert. Insert length was determined to be 
approximately 1900 base pairs. The complete nucleotide sequence of the clone (shown in 
FIG. 4) was obtained by initially sequencing with plasmid primers and subsequently with 
1 0 primers near the end of each newly obtained sequence. All sequence information from each 
reaction was compiled and analyzed using the Sequencher™ program (Gene Codes 
Corporation, Ann Arbor, MI). Sequencing was continued until both strands of the cloned 
cDNA insert were sequenced in entirety. In FIG. 4, the sense strand (SEQ ID NO: 1) is shown 
on top with its complement (SEQ ID NO:2) aligned directly beneath. 

1 5 The deduced amino acid sequence of clone #907334 (SEQ ID NO: 10, also shown in 

FIG. 4) encoded a polypeptide of at least 470 amino acids but lacked an initiating methionine 
start codon and a portion of the putative pro-peptide domain. The amino acid sequence was 
compared to those of other MMPs (FIG. 5) and shown to contain a cysteine switch motif 
(SEQ ID NO:8) within the putative pro-peptide domain which was 86% homologous to the 

2 0 consensus cysteine switch motif of the other MMPs (PRCGVPD, SEQ ID NO: 11). In 

addition, MMP19 contained an identical version (100%) of the highly conserved zinc binding 
consensus sequence (HEXaaGHXaaLGLXaaHS, SEQ ID NO: 12) found in the putative 
catalytic domain. The sequence also contained a potential recognition and cleavage site 
(RXKR, SEQ ID NO:33) for Kex2-like proteases shown to be involved in enzyme activation 

2 5 of stromelysin 3 and the membrane-type MMPs. 

Example 2: Localization of Clone #907334 Transcripts 

A. Nothern Blotting : The entire nucleotide sequence of clone #907334 (hereafter 

3 0 referred to as MMP19) was compared to LIFESEQ™ and other databases (for example 

GenBank, Wash. U./Merck EST) using the BLAST and Smith-Waterman programs. Public 
databases did not contain sequences identical to MMP19 cDNA. LIFESEQ™ contained eight 
additional ESTs identical to some portion of MMP19 cDNA (FIG. 6). The nine ESTs in 
LIFESEQ™ which were identical to MMP19 DNA had been sequenced from cDNA libraries 
3 5 derived from 8 different tissues, including colon, lung, prostate, spinal cord, ovary and 

prostate tumor, brain tumor and ganglioneuroma (FIG. 6) Thus, these tissues expressed at 
least small amounts of the mRNA encoded by MMP19 cDNA. 
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In order to determine whether MMP19 mRNA sequences could be detected by 
hybridization, the entire 1717 bp fragment of MMP19 cDNA was radiolabeled with P using 
a commercial random primer labeling kit (Pharmacia, Piscataway, NJ). Specific activity of the 
labeled fragment was determined to be -5 x 10 cpm/^ig DNA. The labeled cDNA was used 
5 as a probe to hybridize to mRNA (2 |Xg polyA+ RNA) on Northern blots (specifically, 

Multiple Tissue Northerns 1 and 2, obtained from Clontech, Palo Alto, CA) which contained 
various human tissue RNA. The blots were prehybridized at 60°C for 1 hour in Express Hyb 
solution (supplied with the blots) and hybridized (also in Express Hyb solution) at the same 
temperature for two hours in the presence of denatured probe at 1 x 10** cpm/mL. After 
1 0 washing the blots once in 2X SSPE + 0.1% SDS (20 min), and twice under stringent 

conditions (0.2X SSPE + 0.1% SDS, 50°C, 20 min. each wash), the filters were exposed and 
analyzed on a Phosphorimager (Molecular Dynamics, Sunnyvale, CA). By Northern blotting, 
it was determined that MMP19 mRNA transcript is approximately 2.4 kb and is present in 
human testes and to a lesser extent in human colon, ovary, small intestine, and prostate tissue 

1 5 (FIG. 1 1). Smaller less abundant transcripts with similar tissue distribution were also 

observed. 

B. Ribonuclease Protection Assay : Alternatively, instead of or in addition to 
performing a Northern blot, a ribonuclease protection assay may be performed to determine 
whether MMP19 is present in particular tissues. A ribonuclease protection assay is performed 

2 0 as follows: 

1. Labeling of Complementary RNA (cRNA) Hybridiz ation Probes . Labeled 
sense and antisense riboprobes are transcribed from the EST sequence which contains an RNA 
polymerase promoter such as SP6 or T7. The sequence may be from a vector containing the 
appropriate EST insert or from a PCR-generated product of the insert using PCR primers 

2 5 which incorporate am RNA polymerase promoter sequence. The transcripts are prepared in a 

20 |iL reaction volume containing 1 |ig of DNA template, 2 jiL of 100 mM dithiothreitol, 0.8 
(XL of RNasin (10-40U), 500 yM each of ATP, CTP, GTP, 5 |UL (alpha 32 P) UTP or 100-500 
(iM biotinylated UTP, and 1 \lL of RNA polymerase in transcription buffer (40 mM Tris-HCl, 
pH 7.5, 6 mM MgCl2, 2 mM spermidine HC1, 5 mM NaCl). Following incubation at 37~C 

3 0 for one hour, the transcripts are treated with DNase I (15 U) for an additional 30 min to digest 

the template. The probes then are isolated by spin columns, salt precipitation or 
electrophoresis techniques which are well-known in the art. Finally, the probes are dissolved 
in lysis buffer (5 M Guanidine Thiocyanate, 0.1 M EDTA, pH 7.0). 

2. Hybridization of Labeled Probe to Target . Approximately 20 |Xg of 

3 5 extracted total cellular RNA, prepared as described in Sambrook, et al supra, is placed in 10 
|iL of lysis buffer and mixed with either (i) I x 1 0^ cpm of radioacti vely labeled probe or (ii) 
250 pg of non-isotopically labeled probe, each in 2 (iL of lysis buffer. The mixture then is 



WO 98/40475 PCT/US98/04694 



incubated at 60«>C for 5 min and hybridized overnight at room temperature. See, T. Kaabache 
et al, Anal Biochem. 232: 225-230 (1995). 

3. RNase Digestion . Hybridizations are terminated by incubation with 380 nL 
of a solution containing 40 jig/mL RNase A and 625 U/mL RNase Tl in 1 mM EDTA, 300 

5 mM NaCl, 30 mM Tris-HCI pH 7.4 for 45-60 min at room temperature. RNase digestion then 
is terminated by the addition of 60 pL of proteinase-K (1 .7 mg/mL) containing 3.3% SDS, 
followed by incubation for 30 min at 37°C. The digested mixture then is extracted with 
phenol :chloroform:isoamyl alcohol to remove protein. The mRNAxRNA hybrids are 
precipitated from the aqueous phase by the addition 4 |Llg yeast tRNA and 800 |iL of ethanol, 

1 0 and incubation at -80°C for 30 min. The precipitates are collected by centrifugation. 

4. Fragment Analysis . The precipitates are dissolved in 5 |xL of denaturing gel 
loading dye (80% formamide, 10 mM EDTA, pH 8.0, 1 mg/mL xylene cyanol, 1 mg/mL 
bromophenol blue) and electrophoresed in 6 % polyacrylamide TBE, 8 M urea denaturing 
gels. The gels are dried under vacuum and autoradiographed. Quantitation can be performed 

15 by comparing the counts obtained from the test samples to a calibration curve that was 

generated by utilizing calibrators that are the sense strand. In cases where non-isotopic labels 
are used, hybrids are transferred from the gels to membranes (nylon or nitrocellulose) by 
blotting and then analyzed using detection systems that employ streptavidin alkaline 
phosphatase conjugates and chemiluminesence or chemifluoresence reagents. Again, 

2 0 expression of an mRNA which is detectable by the labeled probe in a particular tissue suggests 

that MMP19 is produced in that tissue. 

C. Dot Blot/Slot Blot : Dot and slot blot assays are also quick methods to evaluate the 
presence of a specific nucleic acid sequence in a complex mix of nucleic acid. To perform, up 
to 20 \ig of RNA is mixed in 50 ^L of 50% formamide, 7% formaldehyde, IX SSC, incubate 

2 5 1 5 min at 68°C and cool on ice. Then, 100 pL of 20X SSC is added to the RNA mixture and 

loaded under vacuum onto a manifold apparatus that has a prepared nitrocellulose or nylon 
membrane. The membrane is soaked in water, 20X SSC for 1 hour, placed on two sheets of 
20X SSC prewet Whatman #3 filter paper, and loaded into a slot blot or dot blot vacuum 
manifold apparatus. The slot blot is analyzed with probes prepared and labeled as described 

3 0 supra. Other methods and buffers not specifically detailed here are described in J. Sambrook 

et al, supra. 

D. In Situ Hybridization : This method is useful to directly detect specific target 
nucleic acid sequences in cells using detectable nucleic acid hybridization probes. 

Tissues are prepared with cross-linking fixatives agents such as paraformaldehyde or 
3 5 glutaraldehyde for maximum cellular RNA retention. See, L. Angerer et ai, Methods in Cell 
Biol 35: 37-71 (1991). Briefly, the tissue is placed in greater than 5 volumes of 1% 
glutaraldehyde in 50 mM sodium phosphate, pH 7.5 at 4°C for 30 min. The solution is 
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changed with fresh solution for a further 30 min fixing. The fixing solution should have an 
osmolality of approximately 0.375% NaCl. The tissue is washed once in isotonic NaCl to 
remove the phosphate. 

The fixed tissues then are embedded in paraffin, as follows. The tissue is dehydrated 
5 through a series of ethanol concentrations for 15 min each: 50% twice, 70% twice, 85%, 90% 
and 100% twice. The tissue next is soaked in two changes of xylene for 20 min each at room 
temperature; then it is soaked in two changes of 1 xylene: 1 paraffin for 20 min each at 60°C; 
and then it is soaked in three final changes in paraffin for 15 min each. 

The tissue next is cut in 5 |Lim sections using a standard microtome and placed on a 
1 0 slide previously treated with the tissue adhesive 3-aminopropyltriethoxysilane. 

Paraffin is removed from the tissue by two 10 min xylene soaks and rehydrated in a 
series of ethanol concentrations; 99% twice, 95%, 85%, 70%, 50%, 30% and distilled water 
twice. The sections are pre-treated with 0.2 M HC1 for 10 min and permeabilized with 2 
^ig/mL Proteinase-K at 37°C for 15 min. 

1 5 Labeled riboprobes transcribed from the pSPORTl plasmid containing fragments of 

MMP19 cDNA are hybridized to the prepared tissue sections and hybridized overnight at 
56°°C in 3X standard saline extract and 50% formamide. Excess probe is removed by 
washing in 2X standard saline citrate and 50% formamide followed by digestion with 100 
|lg/mL RNase A at 37°oC for 30 min. Fluorescence probe is visualized by illumination with 

2 0 UV light under a microscope. Fluorescence in the cytoplasm is indicative of mRNA 

production. Fluorescence in the nucleus detects the presence of genomic material. 
Alternatively, the sections can be visualized by autoradiography. 

Example 3: Expression of MMP19 Sequences 

25 

A. Construction of Expression Vectors Containing DNA Fragments Encoding the 
Catalytic Domains of MMP19 Protein : The MMP19 plasmid was used as template in PCR 
reactions to generate DNA fragments encoding MMP19 protein for introduction into two 
eukaryotic expression vectors, pcDNA3.1 (Invitrogen, San Diego, CA) and pCINeo 

3 0 (Promega, Madison, WI). The two upstream primers used in PCR reactions, SEQ ID NO:3 

(5 ' -GTATCTCT AG AC ACC ATGTTTGC AAAGC AAGGT AAC A AATGGT AC AAGC-3 ' ) and ' 
SEQ ID NO:4 (5 ' -GTATCTCTAG ACACC ATG AAGGTTCTGTGGGCTGCGTTGCTGGT- 
C AC ATTCCTGGCAGGATGCC AGGCCTTTGCA A AGC A AGGT A AC AA ATGG-3 ' ) 
contained sequences to create an Xbal restriction enzyme site, a Kozak consensus sequence for 
3 5 translation initiation, an ATG start codon, and a sequence of nucleotides from nucleotide 
position 214 to nucleotide position 244 of SEQ ID NO: l, which corresponded to the N- 
terminus of the putative catalytic domain of the enzyme. SEQ ID NO:4 also contained a 
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sequence which encodes a signal peptide (from the secreted protein, apolipoprotein E), for 
transport of the translated protein into the secretory pathway. The signal peptide sequence was 
placed upstream from and in frame with the sequence encoding the N-terminus of the putative 
catalytic domain. Downstream primers used in PCR reactions were either SEQ ID NO:5 (5'- 
5 GTACTTCTAGACTACTTGTC 

CTCCCCG AGTTGGC ATGCC-3 ' ) which encodes an epitope recognized by anti-FLAG M2, 
(available from Sigma Biosciences, St. Louis, MO) or SEQ ID NO:6 (S'-GTACTTCTAGA- 
GATCTTCITCACTGATCAGCTTCTGTTCACCACCGAACAGGG 
-GCATGCC-3') which encodes an epitope recognized by anti-c-Myc (Santa Cruz 
1 0 Biotechnology, Santa Cruz, CA). In both downstream primers, these epitope sequences were 
in frame with the last codon in the open reading frame of the cDNA and were followed by a 
stop codon (TGA) and an Xbal restriction enzyme site. 

PCR amplifications were performed using primers sets SEQ ID NOS:3 and 5, SEQ ID 
NOS:3 and 6, SEQ ID NOS:4 and 5 and SEQ ID NOS:4 and 6 under standard PCR 

1 5 conditions, i.e. in a total reaction volume of 50 )iL containing 200 ^iM of each dNTP wherein 

N was A, T, G and C, 1 jiM of each primer, -50 ng template DN A and 1 unit of AmpliTaq 
DNA polymerase (Perkin Elmer, Norwalk, CT). Amplifications were performed for a total of 
35 cycles (1 cycle = 94°C for 1 min, 60°C for 1 min, and 72°C for 2 min). After 
amplification, PCR products were digested with Xbal, gel purified, and ligated into the Xbal 

2 0 site of pCDNA3.1 and pCINeo. After transformation of the ligated DNA into appropriate 

bacterial hosts (for example DH5a), plasmid DNA was prepared from individual clones and 
subjected to restriction enzyme and sequence analysis to identify clones that contained MMP19 
DNA with the correct sequence and in the proper orientation. 

B. Transfection of MMP19 DNA Expression Vectors into HEK293 Cells: Expression 

2 5 of MMP19 constructs in HEK293 cells is achieved by transfection using a modified calcium- 

phosphate procedure (Chen and Okayama, Mol Cell Biol 7: 2745-2752, 1987). HEK-293 
cells (ATCC CRL1573) are grown in minimal essential medium (MEM) containing 10% fetal 
bovine serum (FBS) and antibiotic/antimycotic solution at 37° C in 5% CO2, 95% O2. Cells 
are transfected with purified plasmid MMP19 DNA (20 Jlg/2.5 X 10 6 cells/10 cm plate). For 

3 0 stable transfections, cells are grown in the presence of G41 8 and MEM/FBS. For both 

transient and stable transfections, serum-free media is harvested 3 days after medium 
replacement. 

C. Detection of MMP 19 protein (Western blot): Media from transfected cells is mixed 
(vohvol, 1:1) with 2X denaturing buffer (10 mM Tris-HCl, pH6.8, 4% SDS, 20% glycerol, 

3 5 1 % B-mercaptoethanol, 0.02% bromphenol blue) and denatured by heating 5 minutes, 95°C. 
Aliquots (20pJ) of each sample are electrophoresed for ~1 hour on a 10-20% Tris-tricine 
minigel (10X10 cm) at -50 mA constant current. Gels are transferred to PVDF membranes 
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by electroblotting at 200 mA for 1 .5 hours. Non-specific binding sites on PVDF membranes 
are blocked for 1 hour at room temperature with 5% nonfat dry milk in Tris-buffered saline 
containing Tween-20 (TBS-Tween) (20 mM Tris, pH 7.6, 140 mM NaCl, 0. 1 % Tween-20). 
Blots are then incubated for 1 hour at room temperature in TBS-T ween/5% milk containing an 
5 appropriate amount of primary antibody (either anti-FLAG or anti-c-Myc). Blots are washed 
in 3 changes of TBS-Tween for a total of 45 minutes at room temperature before incubating 
with the detection antibody (1:1000 dilution) for 1 hour at room temperature in TBS- 
Tween/5% milk. Blots are again washed in 3 changes of TBS-Tween for 45 minutes at room 
temperature. Protein bands are visualized by autoradiography using ECL Western blotting 
1 0 detection reagents (Amersham, Arlington Heights, IL). 

Example 4: Assays of Enzymatic Activity of MMP1 9 

Purification of the MMP1 9 protein containing the FLAG sequence is performed by 

1 5 immunoaffinity chromatography using an affinity matrix comprising anti-FLAG M2 

monoclonal antibody covalently attached to agarose by hydrazide linkage (Eastman Kodak 
Co., New Haven, CT). Prior to affinity purification, medium from transfected HEK293 cells 
is exchanged into 50 mM Tris-HCl pH 7.5, 150 mM NaCl buffer using a Sephadex G-25 
(Pharmacia Biotech Inc., Uppsala, Sweden) column. Protein in this buffer is applied to the 

2 0 anti-FLAG M2 antibody affinity column, non-binding protein is eluted by washing the column 

with 50 mM Tris-HCl pH 7.5, 150 mM NaCl buffer, and bound protein is eluted using an 
excess of FLAG peptide in 50 mM Tris-HCl, pH 7.5, 150 mM NaCl. The excess FLAG 
peptide is removed by size exclusion chromatography and column fractions containing 
MMP19 protein are dialyzed in 50 mM Tris-HCl, pH 7.5, 150 mM NaCl, 10 MM CaCl. 

2 5 Assays to determine enzyme activity are performed using a fluorogenic peptide 

substrate Gly-Glu(EDANS)-Gly-Pro-Leu-Gly-Leu-Tyr-Ala-Lys(DABCYL)-Gly (SEQ ID 
NO: 13). SEQ ID NO: 1 3 was synthesized and purified acording the the procedure of E.D. 
Matayoshi etal, Science, 247: 954-958, 1990). Hydrolysis of the Gly-Leu bond in SEQ ID 
NO: 13 results in a 40-fold increase in fluorescence. 

3 0 Purified MMP19 protein from transfected HEK293 cell medium is serially diluted and 

incubated in microtiter plate wells with 100 [XM fluorogenic peptide substrate, 50 mM Tris- 
HCl, pH 7.5, 1 50 mM NaCl in a final volume of 150 |iL. Assay mixtures are incubated at 
room temperature and the progress of the reaction monitored for up to 1 hour in an Titertek 
Fluoroskan II instrument (ICN Biomedicals, Huntsville, AL) with an excitation filter set at 335 
3 5 nm and emission filter at 485 nm. Background fluorescence is determined using identical 

reactions in the absence of MMP19. Data is collected online with a Macintosh computer using 
DELTA SOFT II, version 4.0 (BioMetallics, Inc., Princeton, NJ). Nonlinear curve fitting is 
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performed using KaleidaGraph (Synergy Software, Reading, PA). Compounds which 
potentially inhibit MMP19 activity are screened by including serial dilutions of each compound 
with the above reaction and comparing the MMP19 enzymatic activity in the presence and 
absence of compound. 

Example 5: Proteolysis of pro-MMPs by MMP19 

Interaction of MMP19 with other MMPs is determined by incubation of purified active 
MMP19 with the zymogen (pro-peptide) forms of Gelatinase-A (72 kD) [MMP-2] and 
Gelatinase-B (92 kD) [MMP-9]. Gelatinase-A and Gelatinase-B are isolated from HT-1080 
cell cultures stimulated with TNFa as described in J. Antibiotics 45: 1733-1737, 1992 and 
Biochem. 7. 285: 603-61 1, 1992. After 1 hour incubation at 37°C in buffer containing 50 mM 
Tris-HCl, pH 7.5, 150 mM NaCl and approximately 250 ng of each enzyme, aliquots are 
removed and electrophoresed on 10-20% Tris-Tricine gels. Western blots of gels are 
performed using antibodies to Gelatinse A and Gelatinase B (Oncogene Science, Cambridge, 
MA) as described above and proteolysis is indicated by the presence of bands of lower 
molecular weight relative to the pro-enzyme forms of Gelatinase A and Gelatinase B. 

F.xam ple 6: Productio n of Polyclonal Antibodies to MMP19 

Synthetic peptides (SEQUENCE ID NOS: 14-17, see Table 1 below) are prepared 
based upon the predicted amino acid sequence of the MMP1 9 polypeptide. 



Table 1 




Sequence 


SEO ID NO: 


RHRTKMRRKKRFAKQGN 


14 


FQGDHNDGLGNAFDG 


15 


RSLQDWGGIPEEVSGALPRPDGSII 


16 


ATELPWMGCWHANSGSALF 


17 



Peptides are synthesized on an ABI Peptide Synthesizer (Applied Biosystems, Foster 
City, CA), using standard reagents and conditions known in the art for solid phase peptide 
synthesis (see for example, Stewart, J.M., and Young, D.J., Solid Phase Peptide Synthesis, 
W.H. Freeman Co., San Francisco, 1963). Cleavage of the peptide from the resin and final 
deprotection of the peptide are achieved by adding the resin to 20 ml trifluoroacetic acid (TFA), 
0.3 ml water, 0.2 ml ethanedithiol, 0.2 ml thioanisole and 100 mg phenol, and stirring at room 
temperature for 1 .5 hours. The resin then is filtered by suction and the peptide obtained by 
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precipitation of the TFA solution with ether, followed by filtration. Each peptide is purified 
via reverse-phase preparative HPLC using a water/acetonitrile/O. 1 % TFA gradient and 
lyophilized. The product is confirmed by mass spectrometry. 

To generate antigens for immunization, the purified peptides are conjugated to Keyhole 
5 Limpet Hemocyanin (KLH) and bovine serum albumin (BSA) using an Imject Activated 

Immunogen Conjugation Kit (Pierce, Rockford, DL) in accordance with the manufacturers 
instructions. 

Polyclonal antisera are generated using the protocol of the Berkeley Antibody 
Company (Berkeley, CA). Before receiving the first immunization, a sample of preimmune 
1 0 blood (5 ml) is drawn from each of at least 2 rabbits. Afterward, each rabbit is injected 
subcutaneously with an aliquot of KLH-conjugated peptide (200-500 mg) in Complete 
Freunds Adjuvant. After 21 days, the immune response is boosted with a second 
subcutaneous injection of KLH-conjugated peptide (100-250 mg) in Incomplete Freund's 
Adjuvant. Blood (50 ml) is collected on day 31 and serum tested for reactivity to BSA- 

1 5 coupled peptide using an enzyme linked immunoadsorbant assay (ELISA). Subsequent boosts *, 

with KLH-conjugated peptide are given on days 42, 63 and 84 (post injection #1) and 
production bleeds (50 ml) drawn on days 52, 73 and 94 for testing by ELISA to determine 
antibody titer. Serum is then stored at -20°C until further use. 

2 0 Exa mple 7: Isolation of the 5' end of MMP19 cDNA 

CapFinder cDNA synthesis kit (Clontech, Palo Alto, CA) was used to amplify DNA at 
the 5' end of MMP19 cDNA. Briefly, reverse transcription reactions using human testes 
mRNA (Clontech, Palo Alto, CA) were performed as specified by the manufacturer. The 

2 5 cDNA was amplified by PCR using an oligonucleotide primer complementary to MMP1 9 

cDNA (nucleotides 704-678 of SEQ ID NO:l) and the 5' CapSwitch oligonucleotide. PCR 
amplifications were performed in a total reaction volume of 100 |xL containing 200 |iM of each 
dNTP wherein N was A, T, G and C, 1 |iM of each primer, 20 |iL of cDNA from the reverse 
transcription reaction, and 1 unit of AmpliTaq DNA polymerase (Perkin Elmer, Norwalk, 

3 0 CT). Amplifications were performed for a total of 35 cycles ( 1 cycle = 94°C for 1 min, 60°C 

for 1 min, and 72°C for 2 min). Two PCR reaction products (1 .3 kb and 1 .0 kb) identified by 
agarose gel electrophoresis and ethidium bromide staining were large enough to contain 
sequences upstream of the 5' end of the original sequence. These fragments were purified and 
subcloned into pCR-Script (Stratagene, La Jolla, CA) in accordance with the protocol provided 
3 5 by the supplier. 

Clones are analyzed for sequences encoding the putative 5' end of MMP19 mRNA 
using an oligonucleotide complementary to MMP19 cDNA (nucleotides 704-678 of SEQ ID 
NO:l) in standard nucleotide sequencing reactions. Clones are verified as authentic MMP19 
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cDNA by PCR amplification of predicted size bands using testes mRNA as template and 
primers within both the new sequence and complementary to the original MMP19 cDNA 
sequence (nucleotides 655-626 of SEQ ID NO: 1). 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: Falduto, Michael 

Magnuson, Scott R. 
Morgan, Douglas W. Morgan 

<ii) TITLE OF THE INVENTION: HUMAN MATRIX METALLOPROTEASE , 
PROTEINS ENCODED THEREFROM AND USES THEREOF 

(iii) NUMBER OF SEQUENCES: 33 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Abbott Laboratories 

(B) STREET: 100 Abbott Park Road 

(C) CITY: Abbott Park 

(D) STATE: IL 

(E) COUNTRY: USA 

(F) ZIP: 60064-3500 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: ll-MAR-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Casuto, Dianne 

(B) REGISTRATION NUMBER: P-40,943 

(C) REFERENCE /DOCKET NUMBER: 6073. US. 01 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (847) 938-3137 

(B) TELEFAX: (847) 938-2623 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO : 1 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



fcAAGCTCCCA CCTCCACTCG ATTCAGCGAT GCCATCAGAG CGTTTCAGTG GGTGTCCCAG 
SSSS GCGGCGTGTT GGACCGCGCC AACCTGCGCC AGATGACTCG TCCCCGCTGC 
Sggttacag ATACCAACAG TTATGCGGCC TGGGCTGAGA GGATCAGTGA CTTGTTTGCT 
SaSSgga cSaaatgag GCGTAAGAAA CGCTTTGCAA AGCAAGGTAA CAAATGGTAC 
SSgScc JSStaccg CCTGGTGAAC TGGCCTGAGC ATCTGCCGGA GCCGGCAGTT 
ScScG TCCGCGCCGC CTTCCAGTTG TGGAGCAACG TCTCAGCGCT GGAGTTCTGG 
SgScag cScaggccc CGCTGACATC CGGCTCACCT TCTTCCAAGG GGACCACAAC 
StgSSgg gSSgcctt TGATGGCCCA GGGGGCGCCC TGGCGCACGC CTTCCTGCCC 
ScSSS SSacn CGACCAAGAT GAGCGCTGGT CCCTGAGCCG CCGCCGCGGG 
CGcScSS TCGTGGTGCT GGCGCACGAG ATCGGTCACA CGCTTGGCCT CACCCACTCG 
CCCGCGCCGC GCGCGCTCAT GGCGCCCTAC TACAAGAGGC TGGGCCGCGA CGCGCTGCTC 
aSgggS ACGTGCTGGC CGTGCAGAGC CTGTATGGGA AGCCCCTAGG GGGCTCAGTG 
ScSgc SSggaaa GCTGTTCACT GACTTTGAGA CCTGGGACTC CTACAGCCCC 
SSSg? GCCCTGAAAC GCAGGGCCCT AAATACTGCC actcttcctt cgatgccatc 
Stgtagaca ggcaacagca actgtacatt tttaaaggga gccatttctg ggaggtggca 
gSgatggca acgtctcaga gccccgtcca ctgcaggaaa gatgggtcgg gctgcccccc 
aacattgagg ctgcggcagt gtcattgaat gatggagatt tctacttctt caaagggggt 
cgatgctgga ggttccgggg ccccaagcca gtgtggggtc tcccacagct gtgccgggca 

GGGGGCCTGC CCCGCCATCC TGACGCCGCC CTCTTCTTCC CTCCTCTGCG CCGCCTCATC 
CTCTTCAAGG GTGCCCGCTA CTACGTGCTG GCCCGAGGGG GACTGCAAGT GGAGCCCTAC 
ScCCCCGAA GTCTGCAGGA CTGGGGAGGC ATCCCTGAGG AGGTCAGCGG CGCCCTGCCG 
AGGCCCGATG GCTCCATCAT CTTCTrCCGA GATGACCGCT ACTGGCGCCT CGACCAGGCC 1320 
SSSgG CAACCACCTC GGGCCGCTGG GCCACCGAGC TGCCCTGGAT GGGCTGCTGG 1380 
CATGCCAACT CGGGGAGCGC CCTGTTCTGA AGGCACCTCC TCACCTCAGA AACTGGTGGT 
GCTCTCAGGG CAAAATCATG TTCCCCACCC CCGGGGCAGA ACCCCTCTTA GAAGCCTCTG 

aSccctctg CAGAAGACCG GGCAGCAAAG cctccatctg gaagtctgtc tgcctttgtt 

CCTTGAAGAA TGCAGCATTG TCTTTGTCTG TCCCCACCAC ATGGAGGTGG GGGTGGGATC 
AATCTTAGGA AAAGCAAAAA AGGGTCCCAG ATCCCTTGGC CCTTTCCTCC gaaaaaaaaa 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAA 

(2) INFORMATION FOR SEQ ID NO:2: 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 

960 
1020 
1080 
1140 
1200 
1260 



1380 
1440 
1500 
1560 
1620 
1680 
1717 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: 

TTTCGAGGGT GGAGGTGAGC TAAGTCGCTA 
GATGGACAGT CGCCGCACAA CCTGGCGCGG 
CCCCAATGTC TATGGTTGTC AATACGCCGG 



SEQ ID NO: 2: 

CGGTAGTCTC GCAAAGTCAC CCACAGGGTC 
TTGGACGCGG TCTACTGAGC AGGGGCGACG 
ACCCGACTCT CCTAGTCACT GAACAAACGA 
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240 
300 
360 
420 
480 
540 
600 
660 
720 
7B0 
840 
900 
960 



TCTGTGGCCT GGTTTTACTC CGCATTCTTT GCGAAACGTT TCGTTCCATT GTTTACCATG 
TTCGTCGTGG AGAGGATGGC GGACCACTTG ACCGGACTCG TAGACGGCCT CGGCCGTCAA 
GCCCCGCGGC ACGCGCGGCG GAAGGTCAAC ACCTCGTTGC AGAGTCGCGA CCTCAAGACC 
CTCCGGGGTC GGTGTCCGGG GCGACTGTAG GCCGAGTGGA AGAAGGTTCC CCTGGTGTTG 
CTACCCGACC CGTTACGGAA ACTACCGGGT CCCCCGCGGG ACCGCGTGCG GAAGGACGGG 
GCGGCGCCGC TTCGCGTGAA GCTGGTTCTA CTCGCGACCA GGGACTCGGC GGCGGCGCCC 
GCGTTGGACA AGCACCACGA CCGCGTGCTC TAGCCAGTGT GCGAACCGGA GTGGGTGAGC 
GGGCGCGGCG CGCGCGAGTA CCGCGGGATG ATGTTCTCCG ACCCGGCGCT GCGCGACGAG 
TCGACCCTGC TGCACGACCG GCACGTCTCG GACATACCCT TCGGGGATCC CCCGAGTCAC 
CGGCAGGTCG AGGGTCCTTT CGACAAGTGA CTGAAACTCT GGACCCTGAG GATGTCGGGG 
GTTCCTTCCG CGGGACTTTG CGTCCCGGGA TTTATGACGG TGAGAAGGAA GCTACGGTAG 
TGACATCTGT CCGTTGTCGT TGACATGTAA AAATTTCCCT CGGTAAAGAC CCTCCACCGT 
CGACTACCGT TGCAGAGTCT CGGGGCAGGT GACGTCCTTT CTACCCAGCC CGACGGGGGG 

SgtaaScc gacgccgtca cagtaactta ctacctctaa AGATGAAGAA GTTTCCCCCA 1020 

GCTACGACCT CCAAGGCCCC GGGGTTCGGT CACACCCCAG AGGGTGTCGA CACGGCCCGT 1080 

rcCCCGGACG GGGCGGTAGG ACTGCGGCGG GAGAAGAAGG GAGGAGACGC GGCGGAGTAG 1140 

GAGAAGTTCC CACGGGCGAT GATGCACGAC CGGGCTCCCC CTGACGTTCA CCTCGGGATG 1200 

ATGGGGGCTT CAGACGTCCT GACCCCTCCG TAGGGACTCC TCCAGTCGCC GCGGGACGGC 1260 

TCCGGGCTAC CGAGGTAGTA GAAGAAGGCT CTACTGGCGA TGACCGCGGA GCTGGTCCGG 1320 

TTTGACGTCC GTTGGTGGAG CCCGGCGACC CGGTGGCTCG ACGGGACCTA CCCGACGACC 1380 

GTACGGTTGA GCCCCTCGCG GGACAAGACT TCCGTGGAGG AGTGGAGTCT TTGACCACCA 1440 

CGAGAGTCCC GTTTTAGTAC AAGGGGTGGG GGCCCCGTCT TGGGGAGAAT CTTCGGAGAC 1500 
TCAGGGAGAC GTCTTCTGGC CCGTCGTTTC GGAGGTAGAC CTTCAGACAG ACGGAAACAA 
GGAACTTCTT ACGTCGTAAC AGAAACAGAC AGGGGTGGTG TACCTCCACC CCCACCCTAG 
TTAGAATCCT TTTCGTTTTT TCCCAGGGTC TAGGGAACCG GGAAAGGAGG CTTTTTTTTT 
TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT TTTTTTT 



1500 
1560 
1620 
1680 
1717 



49 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GTATCTCTAG ACACCATGTT TGCAAAGCAA GGTAACAAAT GGTACAAGC 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

GTATCTCTAG ACACCATGAA GGTTCTGTGG GCTGCGTTGC TGGTCACATT CCTGGCAGGA 60 
TGCCAGGCCT TTGCAAAGCA AGGTAACAAA TGG 93 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GTACTTCTAG ACTACTTGTC ATCGTCGTCC TTGTAGTCAC CACCGAACAG GGCGCTCCCC 60 
GAGTTGGCAT GCC 73 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GTACTTCTAG AGATCTTCTT CACTGATCAG CTTCTGTTCA CCACCGAACA GGGCGCTCCC 60 
CGAGTTGGCA TGCC 74 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 820 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

15 10 15 

Xaa Xaa Xaa Leu Xaa Xaa. Leu Leu Leu Leu Xaa Xaa Xaa Xaa Xaa Xaa 

20 25 30 

Xaa Leu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr Leu Xaa Xaa Tyr Tyr Xaa Leu Xaa 

50 55 60 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
65 70 75 80 

Xaa Xaa Xaa Leu Xaa Xaa Met Gin Xaa Phe Phe Gly Leu Xaa Val Thr 

85 90 95 

Gly Lys Leu Asp Xaa Xaa Thr Leu Glu Xaa Met Xaa Lys Pro Arg Cys 
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100 



105 



110 



Gly Val Pro Asp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Val Xaa Xaa 

115 120 125 

Phe Xaa Leu Xaa Pro Xaa Xaa Pro Lys Trp Xaa Xaa Xaa Xaa Xaa Thr 

130 135 140 

Tyr Arg lie Xaa Asn Tyr Thr Pro Asp Leu Xaa Xaa Xaa Xaa Val Asp 
145 150 155 160 

Xaa Ala He Xaa Lys Ala Phe Xaa Val Trp Ser Xaa Val Thr Pro Leu 

165 170 175 

Xaa Phe Xaa Xaa Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Gly 

180 185 190 

Xaa Ala Asp He Met He Xaa Phe Ala Xaa Xaa Glu His Gly Asp Xaa 

195 200 205 

Xaa Pro Phe Asp Gly Pro Gly Gly Xaa Leu Ala His Ala Phe Xaa Pro 

210 215 220 

Gly Pro Gly He Gly Gly Asp Ala His Phe Asp Asp Asp Glu Xaa Trp 
225 230 235 240 

Thr Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

245 250 255 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

260 265 270 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

275 280 285 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

290 295 300 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
305 310 315 320 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

325 330 335 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

340 345 350 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

355 360 365 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

370 375 380 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
385 390 395 400 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

405 410 415 

Xaa Xaa Xaa Xaa Xaa Gly Xaa Asn Leu Phe Leu Val Ala Ala His Glu 

420 425 430 

Xaa Gly His Ser Leu Gly Leu Xaa His Ser Xaa Asp Pro Xaa Ala Leu 

435 440 445 

Met Tyr Pro Xaa Tyr Xaa Xaa Phe Xaa Asp Xaa Xaa Xaa Phe Xaa Leu 

450 455 460 

Xaa Xaa Asp Asp lie Xaa Gly lie Gin Xaa Leu Tyr Gly Xaa Xaa Xaa 
465 470 475 480 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

485 490 495 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

500 505 510 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 



515 



520 



525 
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Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro Xaa Xaa Cys Asp Xaa Xaa Xaa 

530 535 540 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Asp Ala lie Xaa Xaa Xaa 
545 550 555 560 

Arg Gly Glu Xaa Phe Phe Phe Lys Asp Arg Phe Phe Trp Arg Xaa Xaa 

565 570 575 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu lie Xaa Xaa Phe 

580 585 590 

Trp Pro Xaa Leu Pro Xaa Xaa lie Asp Ala Ala Tyr Glu Xaa Xaa Xaa 

595 600 605 

Xaa Xaa Xaa Val Phe Phe Phe Lys Gly Xaa Xaa Tyr Trp Xaa Tyr Xaa 

610 615 620 

Gly Xaa Xaa Xaa Xaa Xaa Gly Tyr Pro Xaa Xaa He Xaa Xaa Xaa Leu 
625 630 635 640 

Gly Phe Pro Xaa Xaa Val Xaa Xaa He Asp Ala Ala Xaa Xaa Xaa Xaa 

645 650 655 

Xaa Xaa Xaa Xaa Lys Thr Tyr Phe Phe Xaa Xaa Xaa Xaa Tyr Trp Arg 

660 665 670 

Tyr Asp Glu Xaa Xaa Xaa Xaa Met Asp Pro Gly Tyr Pro Lys Xaa He 

675 680 685 

Xaa Xaa Xaa Phe Xaa Gly He Xaa Xaa Xaa Val Asp Ala Val Phe Xaa 

690 695 700 

Xaa Xaa Xaa Xaa Gly Phe Xaa Tyr Phe Phe Xaa Gly Xaa Xaa Xaa Tyr 
705 710 715 720 

Xaa Phe Xaa Xaa Xaa Xaa Xaa Xaa Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

725 730 735 

Xaa Xaa Xaa Xaa Trp Leu Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

740 745 750 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

755 760 765 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

770 775 780 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
785 790 795 800 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
805 810 815 

Xaa Xaa Xaa Xaa 
820 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



Pro Arg Cys Gly Val Thr Asp 
1 5 
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(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



Arg Lys Lys Arg 
1 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Lvs Ala Pro Thr Ser Thr Arg Phe Ser Asp Ala He Arg Ala Phe Gin 

1 5 io 15 

Trp Val Ser Gin Leu Pro Val Ser Gly Val Leu Asp Arg Ala Asn Leu 

20 25 30 

Arg Gin Met Thr Arg Pro Arg Cys Gly Val Thr Asp Thr Asn Ser Tyr 

35 40 45 

Ala Ala Trp Ala Glu Arg He Ser Asp Leu Phe Ala Arg His Arg Thr 

50 55 60 

Lys Met Arg Arg Lys Lys Arg Phe Ala Lys Gin Gly Asn Lys Trp Tyr 
65 70 75 80 

Lys Gin His Leu Ser Tyr Arg Leu Val Asn Trp Pro Glu His Leu Pro 

85 90 95 

Glu Pro Ala Val Arg Gly Ala Val Arg Ala Ala Phe Gin Leu Trp Ser 

100 105 HO 

Asn Val Ser Ala Leu Glu Phe Trp Glu Ala Pro Ala Thr Gly Pro Ala 

115 120 125 

Asp He Arg Leu Thr Phe Phe Gin Gly Asp His Asn Asp Gly Leu Gly 

130 135 140 

Asn Ala Phe Asp Gly Pro Gly Gly Ala Leu Ala His Ala Phe Leu Pro 
145 ~ 150 155 160 

Arg Arg Gly Glu Ala His Phe Asp Gin Asp Glu Arg Trp Ser Leu Ser 

165 170 175 

Arg Arg Arg Gly Arg Asn Leu Phe Val Val Leu Ala His Glu He Gly 

180 185 190 

His Thr Leu Gly Leu Thr His Ser Pro Ala Pro Arg Ala Leu Met Ala 
195 200 205 
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Pro Tyr Tyr Lys Arg Leu Gly Arg Asp Ala Leu Leu Ser Trp Asp Asp 

210 215 220 

Val Leu Ala Val Gin Ser Leu Tyr Gly Lys Pro Leu Gly Gly Ser Val 
225 230 235 240 

Ala Val Gin Leu Pro Gly Lys Leu Phe Thr Asp Phe Glu Thr Trp Asp 

245 250 255 

Ser Tyr Ser Pro Gin Gly Arg Arg Pro Glu Thr Gin Gly Pro Lys Tyr 

260 265 270 

Cys His Ser Ser Phe Asp Ala He Thr Val Asp Arg Gin Gin Gin Leu 

275 290 285 

Tyr He Phe Lys Gly Ser His Phe Trp Glu Val Ala Ala Asp Gly Asn 

290 295 300 

Val Ser Glu Pro Arg Pro Leu Gin Glu Arg Trp Val Gly Leu Pro Pro 
305 310 315 320 

Asn He Glu Ala Ala Ala Val Ser Leu Asn Asp Gly Asp Phe Tyr Phe 

325 330 335 

Phe Lys Gly Gly Arg Cys Trp Arg Phe Arg Gly Pro Lys Pro Val Trp 

340 34S 350 

Gly Leu Pro Gin Leu Cys Arg Ala Gly Gly Leu Pro Arg His Pro Asp 

355 360 365 

Ala Ala Leu Phe Phe Pro Pro Leu Arg Arg Leu He Leu Phe Lys Gly 

370 375 380 

Ala Arg Tyr Tyr Val Leu Ala Arg Gly Gly Leu Gin Val Glu Pro Tyr 
385 390 395 400 

Tyr Pro Arg Ser Leu Gin Asp Trp Gly Gly He Pro Glu Glu Val Ser 

405 410 415 

Gly Ala Leu Pro Arg Pro Asp Gly Ser He He Phe Phe Arg Asp Asp 

420 425 430 

Arg Tyr Trp Arg Leu Asp Gin Ala Lys Leu Gin Ala Thr Thr Ser Gly 

435 440 445 

Arg Trp Ala Thr Glu Leu Pro Trp Met Gly Cys Trp His Ala Asn Ser 

450 455 460 

Gly Ser Ala Leu Phe 
465 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Pro Arg Cys Gly Val Pro Asp 
1 5 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

His Glu Xaa Oly His Xaa Leu Gly Leu Xaa His Ser 
1 s 10 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:13: 

Gly Glu Gly Pro Leu Gly Leu Tyr Ala Lys Gly 
l 5 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

Arg His Arg Thr Lys Met Arg Arg Lys Lys Arg Phe Ala Lys Gin Gly 
* " 5 10 15 

Asn 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: None. 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Phe Gin Gly Asp His Asn Asp Gly Leu Gly Asn Ala Phe Asp Gly 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

Arg Ser Leu Gin Asp Trp Gly Gly He Pro Glu Glu Val Ser Gly Ala 

1 5 10 15 

Leu Pro Arg Pro Asp Gly Ser He He 
20 25 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Ala Thr Glu Leu Pro Trp Met Gly Cys Trp His Ala Asn Ser Gly Ser 

X 5 10 15 

Ala Leu Phe 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:18; 



Met Asn Cys Gin Gin Leu Trp Leu Gly Phe Leu Leu Pro Met Thr Val 
i 5 10 15 
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Ser Gly Arg Val Leu Gly Leu Ala Glu Val Ala Pro Val Asp Tyr Leu 

20 25 30 

Ser Gin Tyr Gly Tyr Leu Gin Lys Pro Leu Glu Gly Ser Asn Asn Phe 

35 40 45 

Lys Pro Glu Asp He Thr Glu Ala Leu Arg Ala Phe Gin Glu Ala Ser 

SO 55 60 

Glu Leu Pro Val Ser Gly Gin Leu Asp Asp Ala Thr Arg Ala Arg Met 
65 70 7S 60 

Arg Gin Pro Arg Cys Gly Leu Glu Asp Pro Phe Asn Gin Lys Thr Leu 

85 90 95 

Lys Tyr Leu Leu Leu Gly Arg Trp Arg Lys Lys His Leu Thr Phe Arg 

100 105 110 

He Leu Asn Leu Pro Ser Thr Leu Pro Pro His Thr Ala Arg Ala Ala 

115 120 125 

Leu Arg Gin Ala Phe Gin Asp Trp Ser Asn Val Ala Pro Leu Thr Phe 

130 135 140 

Gin Glu Val Gin Ala Gly Ala Ala Asp He Arg Leu Ser Phe His Gly 
145 150 155 160 

Arg Gin Ser Ser Tyr Cys Ser Asn Thr Phe Asp Gly Pro Gly Arg Val 

165 170 175 

Leu Ala His Ala Asp He Pro Glu Leu Gly Ser Val His Phe Asp Glu 

180 185 190 

Asp Glu Phe Trp Thr Glu Gly Thr Tyr Arg Gly Val Asn Leu Arg He 

195 200 205 

He Ala Ala His Glu Val Gly His Ala Leu Gly Leu Gly His Ser Arg 

210 215 220 

Tyr Ser Gin Ala Leu Met Ala Pro Val Tyr Glu Gly Tyr Arg Pro His 
225 230 235 240 

Phe Lys Leu His Pro Asp Asp Val Ala Gly He Gin Ala Leu Tyr Gly 

245 250 255 

Lys Lys Ser Pro Val He Arg Asp Glu Glu Glu Glu Glu Thr Glu Leu 

260 265 270 

Pro Thr Val Pro Pro Val Pro Thr Glu Pro Ser Pro Met Pro Asp Pro 

275 2B0 285 

Cys Ser Ser Glu Leu Asp Ala Met Met Leu Gly Pro Arg Gly Lys Thr 

290 295 300 

Tyr Ala Phe Lys Gly Asp Tyr Val Trp Thr Val Ser Asp Ser Gly Pro 
305 310 315 320 

Gly Pro Leu Phe Arg Val Ser Ala Leu Trp Glu Gly Leu Pro Gly Asn 

325 330 335 

Leu Asp Ala Ala Val Tyr Ser Pro Arg Thr Gin Trp He His Phe Phe 

340 345 350 

Lys Gly Asp Lys Val Trp Arg Tyr He Asn Phe Lys Met Ser Pro Gly 

355 360 365 

Phe Pro Lys Lys Leu Asn Arg Ser Glu Pro Asn Leu Asp Ala Ala Leu 

370 375 380 

Tyr Trp Pro Leu Asn Gin Lys Val Phe Leu Phe Lys Gly Ser Gly Tyr 
385 390 395 400 

Trp Gin Trp Asp Glu Leu Ala Arg Thr Asp Phe Ser Ser Tyr Pro Lys 

405 410 415 

Pro He Lys Gly Leu Phe Thr Gly Val Pro Asn Gin Pro Ser Ala Ala 

420 425 430 

Met Ser Trp Gin Asp Gly Arg Val Tyr Phe Phe Lys Gly Lys Val Tyr 
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470 * 
III Ser Gly Gly Asn Thr Thr Pro Ser Gly Thr Gly He Thr Leu Asp 

Thr Thr Leu Ser "a Thr Glu Thr Thr Phe Glu Tyr 
500 505 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 660 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 

Met Glu Ala Leu Met Ala Arg Gly Ala Leu Thr Gly Pro Leu Arg Ala 

L eu Cys Leu Leu Gly Cys Leu Leu Ser His Ala Ala Ala Ala Pro Ser 

Pro lie lie Lys Phe Pro Gly Asp Val Ala Pro Lys Thr Asp Lys Glu 

40 

L eu Ala Val Gin Tyr Leu Asn Thr Phe Tyr Gly Cys Pro Lys Glu Ser 
cys Asn Leu Phe Val Leu Lys Asp Thr Leu Lys Lys Met Gin Lys Phe 

70 

Phe Gly Leu Pro Gin Thr Gly Asp Leu Asp Gin Asn Thr He Glu Thr 

Met Arg Lys Pro Arg Cys Gly Asn Pro Asp Val Ala Asn Tyr Asn Phe 

Phe Pro Arg lys Pro Lys Trp Asp Lys Asn Gin lie Thr Tyr Arg He 

lie Gly Tyr Thr Pro Asp Leu Asp Pro Glu Thr Val Asp Asp Ala Phe 

135 140 
Ala Arg Ala Phe Gin Val Trp Ser Asp Val Thr Pro Leu Arg Phe Ser 

150 

£ lie His Asp Gly Glu Ala Asp He Met He Asn Phe Gly Arg Trp 

Glu His Gly Asp Oil Tyr Pro Phe Asp Gly Lys Asp Gly Leu Leu Ala 

180 185 
His Ala Phe Ala Pro Gly Thr Gly Val Gly Gly Asp Ser His Phe Asp 

Asp Asp III Leu Trp Thr Leu Z Glu Gly Gin Val Val Arg Val Lys 

Tyr III Asn Ala Asp Gly «. Tyr Cys Lys M. Pro Phe Leu Phe Asn 

"y Lys Glu Tyr Asn Ser Cys Thr Asp Thr Gly Arg Ser Asp Gly Phe 
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245 250 255 

Leu Trp Cys Ser Thr Thr Tyr Asn Phe Glu Lys Asp Gly Lys Tyr Gly 

Phe Cys Pro His Glu Ala Leu Phe Thr Met Gly Gly Asn Ala Glu Gly 

275 280 

Gin Pro Cys Lys Phe Pro Phe Arg Phe Gin Gly Thr Ser Tyr Asp Ser 

295 300 
cys Thr Thr Glu Gly Arg Thr Asp Gly Tyr Arg Trp Cys Gly Thr Thr 

G°iu Asp Tyr Asp Arg Asp Lys Ly S Tyr Gly Phe Cys Pro Glu Thr Ala 
325 

„et Ser Thr Val Gly Gly Asn Ser Glu Gly Ala Pro Cys Val Phe Pro 

Phe Thr Phe Leu Gly Asn Lys Tyr olu Ser Cys Thr Ser Ala Gly Arg 

ser Asp Gly Lys Met Trp Cys Ala Thr Thr Ala Asn Tyr Asp Asp Asp 

Arg Lys Trp Gly Phe Cys Pro Asp Gin Gly Tyr Ser Leu Phe Leu Val 

"a Ala His Glu Phe "ly His Ala Met Gly Leu Glu His Ser Gin Asp 

pro Gly Ala Leu Met Ala Pro lie Tyr Thr Tyr Thr Lys Asn Phe Arg 

420 425 
Leu Ser Gin Asp Asp He Lys Gly He Gin Glu Leu Tyr Gly Ala Ser 

440 44d 

Pro Asp He Asp Leu Gly Thr Gly Pro Thr Pro Thr Leu Gly Pro Val 

455 

Thr Pro Glu He Cys Lys Gin Asp He Val Phe Asp Gly He Ala Gin 

ne Arg Gly Glu He Phe Phe Phe Lys Asp Arg Phe lie Trp Arg Thr 

Val Thr Pro Arg Asp Lys Pro Met Gly Pro Leu Leu Val Ala Thr Phe 

Trp Pro Glu Leu Pro Glu Lys He As P Ala Val Tyr Glu Ala Pro Gin 
515 520 5 

Glu Glu Lys Ala Val Phe Phe Ala Gly Asn Glu Tyr Trp He Tyr Ser 

535 540 
Ala III Thr Leu Glu Arg Gly Tyr Pro Lys Pro Leu Thr Ser Leu Gly 
545 550 555 , 

Leu Pro Pro Asp Val Gin Arg Val Asp Ala Ala Phe Asn Trp Ser Lys 

S65 570 575 

Asn Lys Lys Thr Tyr He Phe Ala Gly Asp Lys Phe Trp Arg Tyr Asn 

580 585 
Glu Val Lys Lys Lys Met Asp Pro Gly Phe Pro Lys Leu He Ala Asp 

595 600 605 

Ala Trp Asn Ala He Pro Asp Asn Leu Asp Ala Val Val Asp Leu Gin 

610 615 620 

Gly Gly Gly His Ser Tyr Phe Phe Lys Gly Ala Tyr Tyr Leu Lys Leu 

HI Asn Gin Ser Leu Lys Ser Val Lys Phe Gly Ser He Lys Ser Asp 
645 650 655 

Trp Leu Gly Cys 
660 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 707 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO-.20: 

Met Ser Leu Trp Gin Pro Leu Val Leu Val Leu Leu Val Leu Gly Cys 

Phe Ala Ala Pro Arg Gin Arg Gin Ser Thr Leu Val Leu Phe Pro 

Gly Asp Leu Arg Thr Asn Leu Thr ^sp Arg Gin Leu Ala Glu Glu Tyr 

Leu Tyr Arg Tyr Gly Tyr Thr Trg Val Ala Glu Met Arg Gly Glu Ser 

Lys Ser Leu Gly Pro Ala Leu Leu Leu Leu Gin Lys Gin Leu Ser Leu 

70 

"o Glu Thr Gly Glu Leu Asp Ser Ma Thr Leu Lys Ala Met Arg Thr 

Pro Arg Cys Gly Val Pro Asp Leu Gly Arg Phe Gin Thr Phe Glu Gly 

100 105 
Asp Leu Lys Trp His His His Asn He Thr Tyr Trp lie Gin Asn Tyr 

115 120 

Ser Glu Asp Leu Pro Arg Ala Val He Asp Asp Ala Phe Ala Arg Ala 

135 140 

Phe "la Leu Tr P Ser Ala Val Thr Pro Leu Thr Phe Thr Arg Val Tyr 

150 155 
Ser Arg Asp Ala Asp He Val He Gin Phe Gly Val Ala Glu Hi. Gly 
165 170 , 

" Asp Gly Tyr Pro Phe Asp Gly Lys Asp Gly Leu Leu Ala His Ala Phe 

1B0 185 
Pro Pro Gly Pro Gly He Gin Gly Asp Ala His Phe Asp Asp Asp Glu 

195 200 205 

Leu Trp Ser Leu Gly Lys Gly Val Val Val Pro Thr Arg Phe Gly Asn 

215 220 

Ala Asp Gly Ala Ala Cys His Phe Pro Phe He Phe Glu Gly Arg Ser 

230 235 * 

Tyr ser Ala Cys Thr Thr Asp Gly Arg Ser Asp Gly Leu Pro Trp Cys 

245 250 
Ser Thr Thr Ala Asn Tyr Asp Thr Asp Asp Arg Phe Gly Phe Cys Pro 

265 270 
ser Glu Arg Leu Tyr Thr Arg Asp Gly Asn Ala Asp Gly Lys Pro Cys 

275 280 

Gin Phe Pro Phe He Phe Gin Gly Gin Ser Tyr Ser Ala Cys Thr Thr 



ASP Gly Arg Ser Asp Gly Tyr Arg Trp Cys Ala Thr Thr Ala Asn Tyr 

310 315 o^u 

Asp Arg Asp Lys Leu Phe Gly Phe Cys Pro Thr Arg Ala Asp Ser Thr 
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325 330 335 

Val Met Gly Oly Asn Ser Ala Gly Glu Leu Cys Val Phe Pro Phe Thr 

340 345 350 

Phe Leu Gly Lys Glu Tyr Ser Thr Cys Thr Ser Glu Gly Arg Gly Asp 

355 360 365 

Gly Arg Leu Trp Cys Ala Thr Thr Ser Asn Phe Asp Ser Asp Lys Lys 

Trp Gly Phe Cys Pro Asp G ? ln Gly Tyr Ser Leu Phe Leu Val Ala Ala 

His Glu Phe Gly His Ala Leu Gly Leu Asp His Ser Ser Val Pro Glu 

405 410 
Ala Leu Met Tyr Pro Met Tyr Arg Phe Thr Glu Gly Pro Pro Leu His 

425 



Lys Asp Asp J" Asn Gly He Arg H~i~s Leu Tyr Gly Pro Arg Pro Glu 
Pro Glu Pro Arg Pro Pro Thr Thr Thr Thr Pro Gin Pro Thr Ala Pro 



455 



Pro inr Val Cys Pro Thr Gly Pro Pro Thr Val His Pro Ser Glu Arg 



470 



Gly Arg Gly Ser Arg Pro Gin Gly Pro Phe Leu lie Ala Asp Lys Trp 



550 



435 440 445 

Glu 
450 
Thr 

III Thr Ala Gly Pro Thr Gly Pro Pro Ser Ala Gly Pro Thr Gly Pro 

485 490 
Pro Thr Ala Gly Pro Ser Thr Ala Thr Thr Val Pro Leu Ser Pro Val 

500 50S 51 

Asp Asp Ala. Cys Asn Val Asn lie Phe Asp Ala He Ala Glu lie Gly 

515 520 
Asn Gin Leu Tyr Leu Phe Lys Asp Gly Lys Tyr Trp Arg Phe Ser Glu 
530 535 540 

III Ala Leu Pro Arg Lys Leu Asp Ser Val Phe Glu Glu Pro Leu Ser 

565 570 575 

Lys Lys Leu Phe Phe Phe Ser Gly Arg Gin Val Trp Val Tyr Thr Gly 

580 585 590 

Ala ser Val Leu Gly Pro Arg Arg Leu Asp Lys Leu Gly Leu Gly Ala 

59S 600 60S 

Aso Val Ala Gin Val Thr Gly Ala Leu Arg Ser Gly Arg Gly Lys Met 

610 "5 620 

Leu Leu Phe Ser Gly Arg Arg Leu Trp Arg Phe Asp Val Lys Ala Gin 
625 630 635 640 

Met Val Asp Pro Arg Ser Ala Ser Glu Val Asp Arg Met Phe Pro Gly 

645 650 655 

Val Pro Leu Asp Thr His Asp Val Phe Gin Tyr Arg Glu Lys Ala Tyr 

660 665 670 

Phe Cys Gin Asp Arg Phe Tyr Trp Arg Val Ser Ser Arg Ser Glu Leu 

675 680 685 

Asn Gin Val Asp Gin Val Gly Tyr Val Thr Tyr Asp lie Leu Gin Cys 

690 695 700 

Pro Glu Asp 
705 

(2) INFORMATION FOR SEQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 477 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



Met 


Lys 


Ser 


Leu 


Pro 


He 


Leu 


Leu Leu 


Leu 


Cvs 


Val 


Ala Val Cvs 


Ser 


j. 








c 








10 






15 






Tyr 


Dm 


Leu 


Asp 


Glv 


Ala 


Ala Arg 


Glv 


Glu 


Asp 


Thr Ser Met 


Asn 








on 








25 








30 




T on 


Val 


will 


uyo 


l y t 


Leu 


Glu 


Asn Tyr 


Tvr 


Asp 


Leu 


Lvs Lvs Asn 


Val 
















An 








4 5 




Lys 


VJJ.I1 




Val 




Ara 


Lys 


Asn Ser 


Glv 


Pro 


Val 


Val Lvs Lvs 


He 




DO 
























Arg 


UlU 


Met 


Gin 


Lys 


Phe 


Leu 


Gly Leu 


GlU 


Val 


Thr 


Glv Lvs Leu 


Asp 


65 










/ U 














cm 
oil 


©er 


Asp 


inr 


T All 




v a x 


Met 




Pro 

*r JL \j 


Arg 


wys 


£!lt# Val P-r»r> 

oiy vol t*ro 


Asp 




























Val 


oiy 


His 


Phe 




Thr 


Phe 


Pro Gly 


He 


Pro 


Lys 


Trn Arcr T.vq 


Thr 

1.1X1. 








100 








105 








110 




His 


Leu 


Thr 


Tvr 

* Jr *■ 


Ara 


He 


Val 


Asn Tyr 


Thr 


Pro 


Asd 


Leu Pro Lvs 


Asp 






115 










120 








125 




Ala 


Val 


Asp 


Ser 


Ala 


Val 


Glu 


Lys Ala 


Leu 


Lys 


Val 


Trn Glu Glu 


Val 




130 










135 








140 






Thr 


Pro 


Leu 


Thr 


Phe 


Ser 


Arcr 


Leu Tyr 


Glu 


Glv 


Glu 


Ala Asd He 


Met 


145 










ISO 








155 






160 


lie 


Ser 


Phe 


Ala 


Val 


Arcr 


Glu 


His Gly 


Asp 


Phe 


Tvr 


Pro Phe Asp 


Glv 

Jr 










165 








170 






175 




Pro 


Gly 


Asn 


Val 


Leu 


Ala 


His 


Ala Tyr 


Ala 


Pro 


Gly 


Pro Gly He 


Asn 








180 








185 








190 




Gly 


Asp 


Ala 


His 


Phe 


Asp 


Asp 


Asp Glu 


Gin 


Trp 


Thr 


Lys Asp Thr 


Thr 






195 










200 








205 




Gly 


Thr 


Asn 


Leu 


Phe 


Leu 


Val 


Ala Ala 


His 


Glu 


He 


Gly His Ser 


Leu 




210 










215 








220 






Gly 


Leu 


Phe 


His 


Ser 


Ala 


Asn 


Thr Glu 


Ala 


Leu 


Met 


Tyr Pro Leu 


Tyr 


225 










230 








235 






240 


His 


Ser 


Leu 


Thr 


Asp 


Leu 


Thr 


Arg Phe 


Arg 


Leu 


Ser 


Gin Asp Asp 


He 










245 








250 






255 




Asn 


Gly 


He 


Gin 


Ser 


Leu 


Tyr 


Gly Pro 


Pro 


Pro 


Asp 


Ser Pro Glu 


Thr 








260 








265 








270 




Pro 


Leu 


Val 


Pro 


Thr 


Glu 


Pro 


Val Pro 


Pro 


Glu 


Pro 


Gly Thr Pro 


Ala 






275 










280 








285 




Asn 


Cys 


Asp 


Pro 


Ala 


Leu 


Ser 


Phe Asp 


Ala 


Val 


Ser 


Thr Leu Arg 


Gly 




290 










295 








300 






Glu 


He 


Leu 


He 


Phe 


Lys 


Asp 


Arg His 


Phe 


Trp 


Arg 


Lys Ser Leu 


Arg 


305 










310 








315 






320 


Lys 


Leu 


Glu 


Pro 


Glu 


Leu 


His 


Leu He 


Ser 


Ser 


Phe 


Trp Pro Ser 


Leu 










325 








330 






335 




Pro 


Ser 


Gly 


Val 


Asp 


Ala 


Ala 


Tyr Glu 


Val 


Thr 


Ser 


Lys Asp Leu 


Val 








340 








345 








350 
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Phe He Phe Lys Gly Asn Gin Phe Trp Ala He Arg Gly Asn Glu Val 

355 360 365 

Arg Ala Gly Tyr Pro Arg Gly He His Thr Leu Gly Phe Pro Pro Thr 

370 375 380 

Val Arg Lys He Asp Ala Ala He Ser Asp Lys Glu Lys Asn Lys Thr 

385 390 395 

Tyr Phe Phe Val Glu Asp Lys Tyr Trp Arg Phe Asp Glu Lys Arg Asn 

* 405 410 415 

Ser Met Glu Pro Gly Phe Pro Lys Gin He Ala Glu Asp Phe Pro Gly 

420 425 430 

He Asp Ser Lys He Asp Ala Val Phe Glu Glu Phe Gly Phe Phe Tyr 

435 440 445 

Phe Phe Thr Gly Ser Ser Gin Leu Glu Phe Asp Pro Asn Ala Lys Lys 

450 455 460 

Val Thr His Thr Leu Lys Ser Asn Ser Trp Leu Asn Cys 

465 470 475 

(2) INFORMATION FOR SBQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 476 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Met Met His Leu Ala Phe Leu Val Leu Leu Cys Leu Pro Val Cys Ser 

5 10 15 

Ala Tyr Pro Leu Ser Gly Ala Ala Lys Glu Glu Asp Ser Asn Lys Asp 

20 25 3 ° 

Leu Ala Gin Gin Tyr Leu Glu Lys Tyr Tyr Asn Leu Glu Lys Asp Val 

35 40 45 

Lys Gin Phe Arg Arg Lys Asp Ser Asn Leu lie Val Lys Lys He Gin 
' 50 55 60 

t Gin Lys Phe Leu Gly Leu Glu Val 
65 7 ° 75 80 



Gly Met Gin Lys Phe Leu Gly Leu Glu Val Thr Gly Lys Leu Asp Thr 

65 70 75 80 

Asp Thr Leu Glu Val Met Arg Lys Pro Arg Cys Gly Val Pro Asp Val 

8S 90 95 

Gly His Phe Ser Ser Phe Pro Gly Met Pro Lys Trp Arg Lys Thr His 

100 105 HO 

Leu Thr Tyr Arg He Val Asn Tyr Thr Pro Asp Leu Pro Arg Asp Ala 

115 120 125 

Val Asp Ser Ala He Glu Lys Ala Leu Lys Val Trp Glu Glu Val Thr 

130 1 3 5 i4 ° 

Pro Leu Thr Phe Ser Arg Leu Tyr Glu Gly Glu Ala Asp He Met He 
145 150 155 160 

Ser Phe Ala Val Lys Glu His Gly Asp Phe Tyr Ser Phe Asp Gly Pro 

165 170 «5 

Gly His Ser Leu Ala His Ala Tyr Pro Pro Gly Pro Gly Leu Tyr Gly 

185 190 



180 
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Asp lie His Phe Asp Asp Asp Glu Lys Trp Thr Glu Asp Ala Ser Gly 

195 200 

Thr Asn Leu Phe Leu Val Ala Ala His Glu Leu Gly His Ser Leu Gly 



L eu III His Ser Ala Asn Thr Glu Ala Leu Met Tyr Pro Leu Tyr Asn 



215 



S« Phe Thr Glu Leu HI Gin Phe Arg Leu ser Gin Asp Asp Val Asn 



230 



235 



245 

Gly He Gin Ser Leu Tyr Gly Pro Pro Pro Ala Ser Thr Glu Glu Pro 



2S0 



260 - — 

Leu val Pro Thr Lys Ser Val Pro Ser Gly Ser Glu Met Pro Ala Lys 



265 270 



Cys Asp Pro Ala Leu Ser Phe Asp Ala He Ser Thr Leu Arg Gly Glu 



280 



285 



295 



300 



Tyr Phe Phe Lys Asp Arg Tyr Phe Trp Arg Arg Ser His Trp Asn 



III Glu Pro Glu Phe His Leu lie Ser Ala "phe Trp Pro Ser Leu Pro 



325 

Ser Tyr Leu Asp Ala Ala Tyr Glu Val Asn Ser Arg Asp Thr Val Phe 
3 40 345 J 

Gly Asn Glu Phe Trp Ala He Arg Gly Asn 

tcc 360 365 

Ala Gly Tyr Pro Arg Gly He His Thr Leu Gly Phe Pro Pro Thr lie 

, 70 375 380 

Arg Lys He Asp Ala Ala Val Ser Asp Lys Glu Lys Lys Lys Thr Tyr 
ia* 390 395 , n 

Phe Phe Ala Ala Asp Lys Tyr Trp Arg Phe Asp Glu Asn Ser Gin Ser 
405 410 415 , 

Met Glu Gin Gly Phe Pro Arg Leu He Ala Asp Asp Phe Pro Gly Val 

420 425 43 

Glu Pro Lys Val Asp Ala Val Leu Gin Ala Phe Gly Phe Phe Tyr Phe 

435 440 44 , 

Phe Ser Gly Ser Ser Gin Phe Glu Phe Asp Pro Asn Ala Arg Met Val 



330 "5 



He Phe Lys Gly Asn Glu Phe Trp Ala He Arg Gly Asn Glu Val Gin 



450 

Thr His He Leu Lys Ser Asn Ser Trp Leu His Cys 



455 460 



465 



470 475 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Met His Ser Phe Pro Pro Leu Leu Leu Leu Leu Phe Trp Gly Val Val 

5 10 15 

Ser His Ser Phe Pro Ala Thr Leu Glu Thr Gin Glu Gin Asp Val Asp 

25 30 



20 
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Leu Val Gin Lys Tyr Leu Glu Lys Tyr Tyr Asn Leu Lys Asn Asp Gly 

35 40 45 

Arg Gin Val Glu Lys Arg Arg Asn Ser Gly Pro Val Val Glu Lys Leu 

50 55 60 

Lys Gin Met Gin Glu Phe Phe Gly Leu Lys Val Thr Gly Lys Pro Asp 
65 70 75 80 

Ala Glu Thr Leu Lys Val Met Lys Gin Pro Arg Cys Gly Val Pro Asp 

85 90 95 

Val Ala Gin Phe Val Leu Thr Glu Gly Asn Pro Arg Trp Glu Gin Thr 

100 105 no 

His Leu Thr Tyr Arg He Glu Asn Tyr Thr Pro Asp Leu Pro Arg Ala 

115 120 125 

Asp Val Asp His Ala He Glu Lys Ala Phe Gin Leu Trp Ser Asn Val 

130 13S 140 

Thr Pro Leu Thr Phe Thr Lys Val Ser Glu Gly Gin Ala Asp He. Met 
145 150 155 160 

He Ser Phe Val Arg Gly Asp His Arg Asp Asn Ser Pro Phe Asp Gly 

165 170 175 

Pro Gly Gly Asn Leu Ala His Ala Phe Gin Pro Gly Pro Gly He Gly 

180 185 190 

Gly Asp Ala His Phe Asp Glu Asp Glu Arg Trp Thr Asn Asn Phe Arg 

195 200 205 

Glu Tyr Asn Leu His Arg Val Ala Ala His Glu Leu Gly His Ser Leu 

210 215 220 

Gly Leu Ser His Ser Thr Asp He Gly Ala Leu Met Tyr Pro Ser Tyr 
225 230 235 240 

Thr Phe Ser Gly Asp Val Gin Leu Ala Gin Asp Asp He Asp Gly He 

245 250 255 

Gin Ala He Tyr Gly Arg Ser Gin Asn Pro Val Gin Pro He Gly Pro 

260 265 270 

Gin Thr Pro Lys Ala Cys Asp Ser Lys Leu Thr Phe Asp Ala He Thr 

275 280 285 

Thr He Arg Gly Glu Val Met Phe Phe Lys Asp Arg Phe Tyr Met Arg 

290 295 300 

Thr Asn Pro Phe Tyr Pro Glu Val Glu Leu Asn Phe He Ser Val Phe 
305 310 315 320 

Trp Pro Gin Leu Pro Asn Gly Leu Glu Ala Ala Tyr Glu Phe Ala Asp 

325 330 335 

Arg Asp Glu Val Arg Phe Phe Lys Gly Asn Lys Tyr Trp Ala Val Gin 

340 345 350 

Gly Gin Asn Val Leu His Gly Tyr Pro Lys Asp He Tyr Ser Ser Phe 

355 360 365 

Gly Phe Pro Arg Thr Val. Lys His He Asp Ala Ala Leu Ser Glu Glu 

370 375 380 

Asn Thr Gly Lys Thr Tyr Phe Phe Val Ala Asn Lys Tyr Trp Arg Tyr 
385 390 395 400 

Asp Glu Tyr Lys Arg Ser Met Asp Pro Gly Tyr Pro Lys Met He Ala 

405 410 415 

His Asp Phe Pro Gly He Gly His Lys Val Asp Ala Val Phe Met Lys 

420 425 430 

Asp Gly Phe Phe Tyr Phe Phe His Gly Thr Arg Gin Tyr Lys Phe Asp 

435 440 445 

Pro Lys Thr Lys Arg He Leu Thr Leu Gin Lys Ala Asn Ser Trp Phe 
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450 455 460 

Asn Cys Arg Lys Asn 
465 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 467 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

Met Phe Ser Leu Lys Thr Leu Pro Phe Leu Leu Leu Leu His Val Gin 

! 5 10 15 

lie Ser Lys Ala Phe Pro Val Ser Ser Lys Glu Lys Asn Thr Lys Thr 

20 25 30 

Val Gin Asp Tyr Leu Glu Lys Phe Tyr Gin Leu Pro Ser Asn Gin Tyr 

35 40 45 

Gin Ser Thr Arg Lys Asn Gly Thr Asn Val lie Val Glu Lys Leu Lys 

50 55 60 

Glu Met Gin Arg Phe Phe Gly Leu Asn Val Thr Gly Lys Pro Asn Glu 
65 70 75 80 

Glu Thr Leu Asp Met Met Lys Lys Pro Arg Cys Gly Val Pro Asp Ser 

85 90 95 

Gly Gly Phe Met Leu Thr Pro Gly Asn Pro Lys Trp Glu Arg Thr Asn 

100 105 110 

Leu Thr Tyr Arg lie Arg Asn Tyr Thr Pro Gin Leu Ser Glu Ala Glu 

115 120 125 

Val Glu Arg Ala lie Lys Asp Ala Phe Glu Leu Trp Ser Val Ala Ser 

130 135 140 

Pro Leu lie Phe Thr Arg lie Ser Gin Gly Glu Ala Asp lie Asn lie 
145 150 155 160 

Ala Phe Tyr Gin Arg Asp His Gly Asp Asn Ser Pro Phe Asp Gly Pro 

165 170 175 

Asn Gly He Leu Ala His Ala Phe Gin Pro Gly Gin Gly He Gly Gly 

180 185 190 

Asp Ala His Phe Asp Ala Glu Glu Thr Trp Thr Asn Thr Ser Ala Asn 

195 200 205 

Tyr Asn Leu Phe Leu Val Ala Ala His Glu Phe Gly His Ser Leu Gly 

210 215 220 

Leu Ala His Ser Ser Asp Pro Gly Ala Leu Met Tyr Pro Asn Tyr Ala 
225 230 235 240 

Phe Arg Glu Thr Ser Asn Tyr Ser Leu Pro Gin Asp Asp He Asp Gly 

245 250 255 

He Gin Ala He Tyr Gly Leu Ser Ser Asn Pro He Gin Pro Thr Gly 

260 265 270 

Pro Ser Thr Pro Lys Pro Cys Asp Pro Ser Leu Thr Phe Asp Ala He 

275 280 285 

Thr Thr Leu Arg Gly Glu He Leu Phe Phe Lys Asp Arg Tyr Phe Trp 
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290 


















300 




Arg Arg 


His 


Pro 


vjin 


Levi 


VJ XH 




Val 
v a x 


Gl n 
vjrxu 


Mp t~ Ann PH*» 

l it- v, i\Sll It I 1C 


T1 « Cat* Leu 

lie OCi. ucu 


*i n c 

3 0b 


















315 


320 


Phe Trp 


Pro 


Ser 


Levi 


Pro 


Thr 


Glv 


He 


Gin 


Ala Ala Tvr* 

nia nxo *j^j> 


Glu Asn Phe 






J « D 










330 




335 


Asp Arg 


Asp 


Leu 


lie 


DVta 
rue 




file 




Glv 


Ann Gin Tvr 
noli vjixii i yi. 


Tm Ala Leu 




340 
















350 


Ser Gly 


Tyr Asp 


ti- 
ne 


Leu 


vvin 




Tyr 


Pro 


r we Ann T 1 a 
Lyo nap x ic 


Cpr* Ann Tvr 
ocx nou * Jr * 


355 










360 






365 




Gly Phe 


Pro 


Ser 




val 


Gin 

O XII 


Ala 


He 


Asp 


Ala Ala Val 


Phe Tvr Arq 


370 










375 








380 




Ser Lys 


Thr Tyr 


Phe 


Phe 


Val 


A Qn 


Ann 


Gin 


PHe Tm Ara 


Tvr Asd Asn 


385 








390 










395 


400 


Gin Arg 


Gin 


Phe 


Met 


Glu 


Pro 


Glv 


Tvr 


Pro 


Lys Ser He 


Ser Gly Ala 






405 










410 




415 


Phe Pro 


Gly 


He 


Glu 


Ser 


Lys 


Val 


Asp 


Ala 


Val Phe Gin 


Gin Glu His 






420 










425 






430 


Phe Phe 


His 


Val 


Phe Ser Gly 


Pro 


Arg 


Tyr 


Tyr Ala Phe 


Asp Leu He 




435 










440 






445 




Ala Gin 


Arg 


Val 


Thr Arg Val 


Ala 


Arg 


Gly 


Asn Lys Trp 


Leu Asn Cys 


450 










455 








460 




Arg Tyr 


Gly 





















465 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



Met His 


Pro 


Gly 


Val 


Leu 


Ala 


Ala Phe 


Leu 


Phe 


Leu Ser 


Trp Thr His 


1 






5 








10 






15 


Cys Arg 


Ala 


Leu 


Pro 


Leu 


Pro 


Ser Gly 


Gly 


Asp 


Glu Asp 


Asp Leu Ser 






20 








25 








30 


Glu Glu 


Asp 


Leu 


Gin 


Phe 


Ala 


Glu Arg 


Tyr 


Leu 


Arg Ser 


Tyr Tyr His 




35 










40 






45 




Pro Thr 


Asn 


Leu 


Ala 


Gly 


He 


Leu Lys 


Glu 


Asn 


Ala Ala 


Ser Ser Met 


50 










55 








60 




Thr Glu 


Arg 


Leu 


Arg 


Glu 


Met 


Gin Ser 


Phe 


Phe 


Gly Leu 


Glu Val Thr 


65 








70 








75 




80 


Gly Lys 


Leu 


Asp 


Asp 


Asn 


Thr 


Leu Asp 


Val 


Met 


Lys Lys 


Pro Arg Cys 








85 








90 






95 


Gly Val 


Pro 


Asp 


Val 


Gly 


Glu 


Tyr Asn 


Val 


Phe 


Pro Arg 


Thr Leu Lys 






100 








105 








110 


Trp Ser 


Lys 


Met 


Asn 


Leu 


Thr 


Tyr Arg 


lie 


Val 


Asn Tyr 


Thr Pro Asp 




115 










120 






125 




Met Thr 


His 


Ser 


Glu 


Val 


Glu 


Lys Ala 


Phe 


Lys 


Lys Ala 


Phe Lys Val 
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130 135 140 

Trp Ser Asp Val Thr Pro Leu Asn Phe Thr Arg Leu His Asp Gly He 
145 150 155 160 

Ala Asp He Met lie Ser Phe Gly He Lys Glu His Gly Asp Phe Tyr 

165 170 175 

Pro Phe Asp Gly Pro Ser Gly Leu Leu Ala His Ala Phe Pro Pro Gly 

180 185 190 

Pro Asn Tyr Gly Gly Asp Ala His Phe Asp Asp Asp Glu Thr Trp Thr 

195 200 205 

Ser Ser Ser Lys Gly Tyr Asn Leu Phe Leu Val Ala Ala His Glu Phe 

210 215 220 

Gly His Ser Leu Gly Leu Asp His Ser Lys Asp Pro Gly Ala Leu Met 
225 230 235 240 

Phe Pro He Tyr Thr Tyr Thr Gly Lys Ser His Phe Met Leu Pro Asp 

245 250 255 

Asp Asp Val Gin Gly He Gin Ser Leu Tyr Gly Pro Gly Asp Glu Asp 

260 265 270 

Pro Asn Pro Lys His Pro Lys Thr Pro Asp Lys Cys Asp Pro Ser Leu 

275 280 285 

Ser Leu Asp Ala He Thr Ser Leu Arg Gly Glu Thr Met He Phe Lys 
290 295 300 

- Asp Arg Phe_Phe„Trp Arg Leu_His_Prp_Gln _Gln_Val As p Ala Glu L eu 



305 310 315 320 

Phe Leu Thr Lys Ser Phe Trp Pro Glu Leu Pro Asn Arg He Asp Ala 

325 330 335 

Ala Tyr Glu His Pro Ser His Asp Leu He Phe He Phe Arg Gly Arg 

340 345 350 

Lys Phe Trp Ala Leu Asn Gly Tyr Asp He Leu Glu Gly Tyr Pro Lys 

355 360 365 

Lys He Ser Glu Leu Gly Leu Pro Lys Glu Val Lys Lys He Ser Ala 

370 375 380 

Ala Val His Phe Glu Asp Thr Gly Lys Thr Leu Leu Phe Ser Gly Asn 
385 390 395 400 

Gin Val Trp Arg Tyr Asp Asp Thr Asn His He Met Asp Lys Asp Tyr 

405 410 415 

Pro Arg Leu He Glu Glu Asp Phe Pro Gly He Gly Asp Lys Val Asp 

420 425 430 

Ala Val Tyr Glu Lys Asn Gly Tyr He Tyr Phe Phe Asn Gly Pro He 

435 440 445 

Gin Phe Glu Tyr Ser He Trp Ser Asn Arg He Val Arg Val Met Pro 

450 455 460 

Ala Asn Ser He Leu Trp Cys 
465 470 

(2) INFORMATION FOR SEQ ID NO: 26: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 470 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: None 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

Met Lys Phe Leu Leu He Leu Leu Leu Gin Ala Thr Ala Ser Gly Ala 

15 io 15 

Leu Pro Leu Asn Ser Ser Thr Ser Leu Glu Lys Asn Asn Val Leu Phe 

20 25 30 

Gly Glu Arg Tyr Leu Glu Lys Phe Tyr Gly Leu Glu He Asn Lys Leu 

35 40 45 

Pro Val Thr Lys Met Lys Tyr Ser Gly Asn Leu Met Lys Glu Lys He 

50 55 60 

Gin Glu Met Gin His Phe Leu Gly Leu Lys Val Thr Gly Gin Leu Asp 
65 70 75 80 

Thr Ser Thr Leu Glu Met Met His Ala Pro Arg Cys Gly Val Pro Asp 

as 90 95 

Val His His Phe Arg Glu Met Pro Gly Gly Pro Val Trp Arg Lys His 

100 105 * no 

Tyr He Thr Tyr Arg He Asn Asn Tyr Thr Pro Asp Met Asn Arg Glu 

115 120 125 

Asp Val Asp Tyr Ala lie Arg Lys Ala Phe Gin Val Trp Ser Asn Val 

130 135 140 

Thr Pro Leu Lys Phe Ser Lys He Asn Thr Gly Met Ala Asp He Leu 
145 150 155 ™ 160 

Val Val Phe Ala Arg Gly Ala His Gly Asp Phe His Ala Phe Asp Gly 

165 170 175 

Lys Gly Gly He Leu Ala His Ala Phe Gly Pro Gly Ser Gly He Gly 

180 185 190 

Gly Asp Ala His Phe Asp Glu Asp Glu Phe Trp Thr Thr His Ser Gly 

195 200 205 

Gly Thr Asn Leu Phe Leu Thr Ala Val His Glu He Gly His Ser Leu 

210 215 220 

Gly Leu Gly His Ser Ser Asp Pro Lys Ala Val Met Phe Pro Thr Tyr 
225 230 235 2 40 

Lys Tyr Val Asp He Asn Thr Phe Arg Leu Ser Ala Asp Asp He Arg 

245 250 * 255 

Gly He Gin Ser Leu Tyr Gly Asp Pro Lys Glu Asn Gin Arg Leu Pro 

260 265 270 

Asn Pro Asp Asn Ser Glu Pro Ala Leu Cys Asp Pro Asn Leu Ser Phe 

275 280 285 

Asp Ala Val Thr Thr Val Gly Asn Lys He Phe Phe Phe Lys Asp Arg 

290 295 300 

Phe Phe Trp Leu Lys Val Ser Glu Arg Pro Lys Thr Ser Val Asn Leu 
305 310 315 320 

He Ser Ser Leu Trp Pro . Thr Leu Pro Ser Gly He Glu Ala Ala Tyr 

325 330 335 

Glu He Glu Ala Arg Asn Gin Val Phe Leu Phe Lys Asp Asp Lys Tyr 

340 345 * 350 

Trp Leu He Ser Asn Leu Arg Pro Glu Pro Asn Tyr Pro Lys Ser He 

355 360 365 

His Ser Phe Gly Phe Pro Asn Phe Val Lys Lys He Asp Ala Ala Val 

370 375 380 

Phe Asn Pro Arg Phe Tyr Arg Thr Tyr Phe Phe Val Asp Asn Gin Tyr 
385 390 395 4 oo 

Trp Arg Tyr Asp Glu Arg Arg Gin Met Met Asp Pro Gly Tyr Pro Lys 
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405 410 415 

Leu He Thr Lys Asn Phe Gin Gly He Gly Pro Lys He Asp Ala Val 

420 425 430 

Phe Tyr Ser Lys Asn Lys Tyr Tyr Tyr Phe Phe Gin Gly Ser Asn Gin 

435 440 445 

Phe Glu Tyr Asp Phe Leu Leu Gin Arg He Thr Lys Thr Leu Lys Ser 

450 455 460 

Asn Ser Trp Phe Gly Cys 
465 470 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 267 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

Met Arg Leu Thr Val Leu Cys Ala Val Cys Leu Leu Pro Gly Ser Leu 

1 5 10 15 

Ala Leu Pro Leu Pro Gin Glu Ala Gly Gly Met Ser Glu Leu Gin Trp 

20 25 30 

Glu Gin Ala Gin Asp Tyr Leu Lys Arg Phe Tyr Leu Tyr Asp Ser Glu 

35 40 45 

Thr Lys Asn Ala Asn Ser Leu Glu Ala Lys Leu Lys Glu Met Gin Lys 

50 55 60 . 

Phe Phe Gly Leu Pro He Thr Gly Met Leu Asn Ser Arg Val He Glu 
65 70 75 . 80 

He Met Gin Lys Pro Arg Cys Gly Val Pro Asp Val Ala Glu Tyr Ser 

85 90 95 

Leu Phe Pro Asn Ser Pro Lys Trp Thr Ser Lys Val Val Thr Tyr Arg 

100 105 110 

He Val Ser Tyr Thr Arg Asp Leu Pro His He Thr Val Asp Arg Leu 

115 120 125 

Val Ser Lys Ala Leu Asn Met Trp Gly Lys Glu He Pro Leu His Phe 

130 135 140 

Arg Lys Val Val Trp Gly Thr Ala Asp He Met He Gly Phe Ala Arg 
145 150 155 160 

Gly Ala His Gly Asp Ser Tyr Pro Phe Asp Gly Pro Gly Asn Thr Leu 

165 170 175 

Ala His Ala Phe Ala Pro Gly Thr Gly Leu Gly Gly Asp Ala His Phe 

180 185 190 

Asp Glu Asp Glu Arg Trp Thr Asp Gly Ser Ser Leu Gly He Asn Phe 

195 200 205 

Leu Tyr Ala Ala Thr His Glu Leu Gly His Ser Leu Gly Met Gly His 

210 215 220 

Ser Ser Asp Pro Asn Ala Val Met Tyr Pro Thr Tyr Gly Asn Gly Asp 
225 230 235 240 

Pro Gin Asn Phe Lys Leu Ser Gin Asp Asp He Lys Gly lie Gin Lys 
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245 250 255 

Leu Tyr Gly Lys Arg Ser Asn Ser Arg Lys Lys 
260 265 

(2) INFORMATION FOR SEQ ID NO: 28: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 582 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

Met Ser Pro Ala Pro Arg Pro Ser Arg Cys Leu Leu Leu Pro Leu Leu 

15 10 15 

Thr Leu Gly Thr Ala Leu Ala Ser Leu Gly Ser Ala Gin Ser Ser Ser 

20 25 30 

Phe Ser Pro Glu Ala Trp Leu Gin Gin Tyr Gly Tyr Leu Pro Pro Gly 

35 40 45 

Asp Leu Arg Thr His Thr Gin Arg Ser Pro Gin Ser Leu Ser Ala Ala 

50 55 60 

lie Ala Ala Met Gin Lys Phe Tyr Gly Leu Gin Val Thr Gly Lys Ala 
65 70 75 80 

Asp Ala Asp Thr Met Lys Ala Met Arg Arg Pro Arg Cys Gly Val Pro 

85 90 95 

Asp Lys Phe Gly Ala Glu lie Lys Ala Asn Val Arg Arg Lys Arg Tyr 

100 105 110 

Ala lie Gin Gly Leu Lys Trp Gin His Asn Glu lie Thr Phe Cys lie 

115 120 125 

Gin Asn Tyr Thr Pro Lys Val Gly Glu Tyr Ala Thr Tyr Glu Ala lie 

130 135 140 

Arg Lys Ala Phe Arg Val Trp Glu Ser Ala Thr Pro Leu Arg Phe Arg 
145 150 155 160 

Glu Val Pro Tyr Ala Tyr lie Arg Glu Gly His Glu Lys Gin Ala Asp 

165 170 175 

lie Met lie Phe Phe Ala Glu Gly Phe His Gly Asp Ser Thr Pro Phe 

180 185 190 

Asp Gly Glu Gly Gly Phe Leu Ala His Ala Tyr Phe Pro Gly Pro Asn 

195 200 205 

lie Gly Gly Asp Thr His Phe Asp Ser Ala Glu Pro Trp Thr Val Arg 

210 215 220 

Asn Glu Asp Leu Asn Gly Asn Asp He Phe Leu Val Ala Val His Glu 
225 230 235 240 

Leu Gly His Ala Leu Gly Leu Glu His Ser Ser Asp Pro Ser Ala He 

245 250 255 

Met Ala Pro Phe Tyr Gin Trp Met Asp Thr Glu Asn Phe Val Leu Pro 

260 265 270 

Asp Asp Asp Arg Arg Gly He Gin Gin Leu Tyr Gly Gly Glu Ser Gly 

275 280 285 

Phe Pro Thr Lys Met Pro Pro Gin Pro Arg Thr Thr Ser Arg Pro Ser 
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NOT FURNISHED UPON FILING 



NO PRESENT ADO(A) EN EL MOMENTO DE LA PRESENT ACION 



NON SOUMIS(E) AU MOMENT DU DEPOT 
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20 25 30 

Leu Leu Val Leu Leu Gly Cys Leu Gly Leu Gly Val Ala Ala Glu Asp 

35 40 45 

Ala Glu Val His Ala Glu Asn Trp Leu Arg Leu Tyr Gly Tyr Leu Pro 

50 55 60 

Gin Pro Ser Arg His Met Ser Thr Met Arg Ser Ala Gin He Leu Ala 
65 70 75 80 

Ser Ala Leu Ala Glu Met Gin Arg Phe Tyr Gly He Pro Val Thr Gly 

85 90 95 

Val Leu Asp Glu Glu Thr Lys Glu Trp Met Lys Arg Pro Arg Cys Gly 

100 105 no 

Val Pro Asp Gin Phe Gly Val Arg Val Lys Ala Asn Leu Arg Arg Arg 

115 120 125 

Arg Lys Arg Tyr Ala Leu Thr Gly Arg Lys Trp Asn Asn His His Leu 

130 135 140 

Thr Phe Ser He Gin Asn Tyr Thr Glu Lys Leu Gly Trp Tyr His Ser 
145 150 155 160 

Met Glu Ala Val Arg Arg Ala Phe Arg Val Trp Glu Gin Ala Thr Pro 

165 170 175 

Leu Val Phe Gin Glu Val Pro Tyr Glu Asp He Arg Leu Arg Arg Gin 

180 185 190 

Lys Glu Ala Asp He Met Val Leu Phe Ala Ser Gly Phe His Gly Asp 

195 200 205 

Ser Ser Pro Phe Asp Gly Thr Gly Gly Phe Leu Ala His Ala Tyr Phe 

210 215 220 

Pro Gly Pro Gly Leu Gly Gly Asp Thr His Phe Asp Ala Asp Glu Pro 
225 230 235 240 

Trp Thr Phe Ser Ser Thr Asp Leu His Gly Asn Asn Leu Phe Leu Val 

245 250 255 

Ala Val His Glu Leu Gly His Ala Leu Gly Leu Glu His Ser Ser Asn 

260 265 270 

Pro Asn Ala He Met Ala Pro Phe Tyr Gin Trp Lys Asp Val Asp Asn 

275 280 28S 

Phe Lys Leu Pro Glu Asp Asp Leu Arg Gly He Gin Gin Leu Tyr Gly 

290 295 300 

Thr Pro Asp Gly Gin Pro Gin Pro Thr Gin Pro Leu Pro Thr Val Thr 
305 310 315 320 

Pro Arg Arg Pro Gly Arg Pro Asp His Arg Pro Pro Arg Pro Pro Gin 

325 330 335 

Pro Pro Pro Pro Gly Gly Lys Pro Glu Arg Pro Pro Lys Pro Gly Pro 

340 345 350 

Pro Val Gin Pro Arg Ala Thr Glu Arg Pro Asp Gin Tyr Gly Pro Asn 

355 360 365 

lie Cys Asp Gly Asp Phe Asp Thr Val Ala Met Leu Arg Gly Glu Met 

370 375 380 

Phe Val Phe Lys Gly Arg Trp Phe Trp Arg Val Arg His Asn Arg Val 
385 390 395 400 

Leu Asp Asn Tyr Pro Met Pro He Gly His Phe Trp Arg Gly Leu Pro 

405 410 415 

Gly Asp He Ser Ala Ala Tyr Glu Arg Gin Asp Gly Arg Phe Val Phe 

420 425 430 

Phe Lys Gly Asp Arg Tyr Trp Leu Phe Arg Glu Ala Asn Leu Glu Pro 
435 440 445 
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Gly Tyr Pro Gin Pro Leu Thr Ser Tyr Gly Leu Gly lie Pro Tyr Asp 

450 455 460 

Arg lie Asp Thr Ala He Trp Trp Glu Pro Thr Gly His Thr Phe Phe 
465 470 475 480 

Phe Gin Glu Asp Arg Tyr Trp Arg Phe Asn Glu Glu Thr Gin Arg Gly 

485 490 495 

Asp Pro Gly Tyr Pro Lys Pro He Ser Val Trp Gin Gly He Pro Ala 

500 505 510 

Ser Pro Lys Gly Ala Phe Leu Ser Asn Asp Ala Ala Tyr Thr Tyr Phe 

515 520 525 

Tyr Lys Gly Thr Lys Tyr Trp Lys Phe Asp Asn Glu Arg Leu Arg Met 

530 535 540 

Glu Pro Gly Tyr Pro Lys Ser He Leu Arg Asp Phe Met Gly Cys Gin 
545 550 555 560 

Glu His Val Glu Pro Gly Pro Arg Trp Pro Asp Val Ala Arg Pro Pro 

565 570 575 

Phe Asn Pro His Gly Gly Ala Glu Pro Gly Ala Asp Ser Ala Glu Gly 

580 585 * 590 

Asp Val Gly Asp Gly Asp Gly Asp Phe Gly Ala Gly Val Asn Lys Asp 

595 600 605 

Gly Gly Ser Arg Val Val Val Gin Met Glu Glu Val Ala Arg Thr Val 

610 615 620 

Asn Val Val Met Val Leu Val Pro Leu Leu Leu Leu Leu Cys Val Leu 
625 630 635 640 

Gly Leu Thr Tyr Ala Leu Val Gin Met Gin Arg Lys Gly Ala Pro Arg 

645 650 655 

Val Leu Leu Tyr Cys Lys Arg Ser Leu Gin Glu Trp Val 
660 665 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 604 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 

Met He Leu Leu Thr Phe Ser Thr Gly Arg Arg Leu Asp Phe Val His 

1 5 io 15 

His Ser Gly Val Phe Phe Leu Gin Thr Leu Leu Trp He Leu Cys Ala 

20 25 30 

Thr Val Cys Gly Thr Glu Gin Tyr Phe Asn Val Glu Val Trp Leu Gin 

35 40 45 

Lys Tyr Gly Tyr Leu Pro Pro Thr Ser Pro Arg Met Ser Val Val Arg 

50 55 60 

Ser Ala Glu Thr Met Gin Ser Ala Leu Ala Ala Met Gin Gin Phe Tyr 
65 70 75 80 

Gly lie Asn Met Thr Gly Lys Val Asp Arg Asn Thr lie Asp Trp Met 
85 90 95 
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Lys Lys Pro Arg Cys Gly Val Pro Asp Gin Thr Arg Gly Ser Ser Lys 

100 105 110 

Phe His He Arg Arg Lys Arg Tyr Ala Leu Thr Gly Gin Lys Trp Gin 

115 120 12s 

His Lys His He Thr Tyr Ser He Lys Asn Val Thr Pro Lys Val Gly 

130 135 140 

Asp Pro Glu Thr Arg Lys Ala He Arg Arg Ala Phe Asp Val Trp Gin 
145 "0 iss iso 

Asn val Thr Pro Leu Thr Phe Glu Glu Val Pro Tyr Ser Glu Leu Glu 

1«5 170 17S 

Asn Gly Lys Arg Asp Val Asp He Pro He He Phe Ala Ser Gly Phe 

180 iss 190 

His Gly Asp Ser Ser Pro Phe Asp Gly Glu Gly Gly Phe Leu Ala His 

195 200 205 

Ala Tyr Phe Pro Gly Pro Gly He Gly Gly Asp Thr His Phe Asp Ser 

210 215 220 

Asp Glu Pro Trp Thr Leu Gly Asn Pro Asn His Asp Gly Asn Asp Leu 
225 "0 235 240 

Phe Leu Val Ala Val His Glu Leu Gly His Ala Leu Gly Leu Glu His 

24S 250 ' 255 

Ser Asn Asp Pro Thr Ala He Met Ala Pro Phe Tyr Gin Tyr Met Glu 

2 <5° 265 270 

Gin Thr Leu Gin Leu Pro Asn Asp Asp Tyr Arg His Gin Arg Tyr Met 

275 280 285 

Ser Pro Asp Lys He Pro Pro Pro Thr Arg Pro Leu Pro Thr Val Pro 

290 295 3 oo 

Pro His Arg Ser He Pro Pro Ala Asp Pro Arg Lys Asn Asp Arg Pro 
305 31° 315 320 

Lys Pro Pro Arg Pro Pro Thr Gly Arg Pro Ser Tyr Pro Gly Ala Lys 

325 330 335 

Pro Asn He Cys Asp Gly Asn Phe Asn Thr Leu Ala He Leu Arg Arg 

340 345 3 S o . 

Glu Met Phe Val Phe Lys Asp Gin Trp Phe Trp Arg Val Arg Asn Asn 

355 360 365 

Arg Val Met Asp Gly Tyr Pro Met Gin He Thr Tyr Phe Trp Arg Gly 

370 375 3 8 o 

Leu Pro Pro Ser He Asp Ala Val Tyr Glu Asn Ser Asp Gly Asn Phe 
385 390 395 400 

Val Phe Phe Lys Gly Asn Lys Tyr Trp Val Phe Lys Asp Thr Thr Leu 

405 410 41S 

Gin Pro Gly Tyr Pro His Asp Leu He Thr Leu Gly Ser Gly He Pro 

420 425 430 

Pro His Gly He Asp Ser Ala He Trp Trp Glu Asp Val Gly Lys Thr 

*3S 440 445 

Tyr Phe Phe Lys Gly Asp Arg Tyr Trp Arg Tyr Ser Glu Glu Met Lys 

450 455 460 

Thr Met Asp Pro Gly Tyr Pro Lys Pro He Thr Val Trp Lys Gly He 
465 470 4 ?S 480 

Pro Glu Ser Pro Gin Gly Ala Phe Val His Lys Glu Asn Gly Phe Thr 

48S 4 90 495 

Tyr Phe Tyr Lys Glu Gly Val Leu Glu He Gin Thr Thr Arg Tyr Ser 

500 505 510 

Arg Leu Glu Pro Gly His Pro Arg Ser He Leu Lys Asp Leu Ser Gly 



WO 98/40475 



PCT/US98/04694 



84 



515 520 525 

Cys Asp Gly Pro Thr Asp Arg Val Lys Glu Gly His Ser Pro Pro Asp 

530 535 540 

Asp Val Asp He Val He Lys Leu Asp Asn Thr Ala Ser Thr Val Lys 
545 * 550 555 560 

Ala He Ala He Val He Pro Cys He Leu Ala Leu Cys Leu Leu Val 

565 570 575 

Leu Val Tyr Thr Val Phe Gin Phe Lys Arg Lys Gly Thr Pro Arg His 

580 585 590 

He Leu Tyr Cys Lys Arg Ser Met Gin Glu Trp Val 
595 600 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 488 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 

Met Ala Pro Ala Ala Trp Leu Arg Ser Ala Ala Ala Arg Ala Leu Leu 

! 5 10 15 

Pro Pro Met Leu Leu Leu Leu Leu Gin Pro Pro Pro Leu Leu Ala Arg 

20 25 30 

Ala Leu Pro Pro Asp Val His His Leu His Ala Glu Arg Arg Gly Pro 

35 40 . 45 

Gin Pro Trp His Ala Ala Leu Pro Ser Ser Pro Ala Pro Ala Pro Ala 

50 55 60 

Thr Gin Glu Ala Pro Arg Pro Ala Ser Ser Leu Arg Pro Pro Arg Cys 
65 70 75 80 

Gly Val Pro Asp Pro Ser Asp Gly Leu Ser Ala Arg Asn Arg Gin Lys 

85 90 95 

Arg Phe Val Leu Ser Gly Gly Arg Trp Glu Lys Thr Asp Leu Thr Tyr 

100 105 HO 

Arg He Leu Arg Phe Pro Trp Gin Leu Val Gin Glu Gin Val Arg Gin 

US 120 125 

Thr Met Ala Glu Ala Leu Lys Val Trp Ser Asp Val Thr Pro Leu Thr 

130 135 140 

Phe Thr Glu Val His Glu Gly Arg Ala Asp He Met He Asp Phe Ala 
145 ISO 155 160 

Arg Tyr Trp Asp Gly Asp Asp Leu Pro Phe Asp Gly Pro Gly Gly He 

165 170 175 

Leu Ala His Ala Phe Phe Pro Lys Thr His Arg Glu Gly Asp Val His 

180 185 190 

Phe Asp Tyr Asp Glu Thr Trp Thr He Gly Asp Asp Gin Gly Thr Asp 

X95 200 205 

Leu Leu Gin Val Ala Ala His Glu Phe Gly His Val Leu Gly Leu Gin 

210 215 220 

His Thr Thr Ala Ala Lys Ala Leu Met Ser Ala Phe Tyr Thr Phe Arg 
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225 

Tyr Pro 

Tyr Gly 

Pro Gin 

Ala Pro 
290 
Arg Gly 
305 

Gly Gly 

Gin Gly 

His lie 

Lys Pro 
370 
Phe Pro 
385 

Tyr Phe 

Arg Val 

Pro Ser 

Phe Leu 
450 
Ala Leu 
465 

Ala Glu 



Leu Ser Leu 
245 

Gin Pro Trp 

260 
Ala Gly He 
275 

Pro Asp Ala 

Glu Leu Phe 

Gin Leu Gin 
325 

Leu Pro Ser 

340 
Trp Phe Phe 
355 

Val Leu Gly 

Val His Ala 

Phe Arg Gly 
405 

Asp Ser Pro 

420 
Glu He Asp 
435 

Arg Gly Arg 

Glu Gly Phe 

Pro Ala Asn 
485 



230 235 240 

Ser Pro Asp Asp Cys Arg Gly Val Gin His Leu 

250 255 
Pro Thr Val Thr Ser Arg Thr Pro Ala Leu Gly 

265 270 
Asp Thr Asn Glu He Ala Pro Leu Glu Pro Asp 

280 285 
Cys Glu Ala Ser Phe Asp Ala Val Ser Thr He 

295 300 
Phe Phe Lys Ala Gly Phe Val Trp Arg Leu Arg 
310 315 320 

Pro Gly Tyr Pro Ala Leu Ala Ser Arg His Trp 

330 335 
Pro Val Asp Ala Ala Phe Glu Asp Ala Gin Gly 

345 350 
Gin Gly Ala Gin Tyr Trp Val Tyr Asp Gly Glu 

360 365 
Pro Ala Pro Leu Thr Glu Leu Gly Leu Val Arg 

375 380 
Ala Leu Val Trp Gly Pro Glu Lys Asn Lys He 
390 395 400 

Arg Asp Tyr Trp Arg Phe His Pro Ser Thr Arg 

410 415 
Val Pro Arg Arg Ala Thr Asp Trp Arg Gly Val 

425 430 
Ala Ala Phe Gin Asp Ala Asp Gly Tyr Ala Tyr 

440 445 
Leu Tyr Trp Lys Phe Asp Pro Val Lys Val Lys 

455 460 
Pro Arg Leu Val Gly Pro Asp. Phe Phe Gly Cys 
470 475 480 

Thr Phe Leu 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 517 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Gin Gin Phe Gly Gly Leu Glu Ala Thr Gly lie Asp Glu Ala Thr 

1 5 10 15 

Leu Ala Leu Met Lys Thr Pro Arg Cys Ser Leu Pro Asp Leu Pro Val 

20 25 30 

Leu Thr Gin Ala Arg Arg Arg Arg Gin Ala Pro Ala Pro Thr Lys Trp 

35 40 45 

Asn Lys Arg Asn Leu Ser Trp Arg Val Arg Thr Phe Pro Arg Asp Ser 
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50 55 60 

Pro Leu Gly His Asp Thr Val Arg Ala Leu Met Tyr Tyr Ala Leu Lys 

65 ™ 75 80 

Val Trp Ser Asp He Ala Pro Leu Asn Phe His Glu Val Ala Gly Ser 

85 90 95 

Thr Ala Asp He Gin He Asp Phe Ser Lys Ala Asp His Asn Asp Gly 

100 105 HO 

Tyr Pro Phe Asp Ala Arg Arg His Arg Ala His Ala Phe Phe Pro Gly 

115 120 125 

His His His Thr Ala Gly Tyr Thr His Phe Asn Asp Asp Glu Ala Trp 

130 I 35 140 

Thr Phe Arg ser Ser Asp Ala His Gly Met Asp Leu Phe Ala Val Ala 
145 150 155 160 

Val His Glu Phe Gly His Ala He Gly Leu Ser His Val Ala Ala Ala 

165 170 175 

His Ser He Met Arg Pro Tyr Tyr Gin Gly Pro Val Gly Asp Pro Leu 

180 185 190 

Arg Tyr Gly Leu Pro Tyr Glu Asp Lys Val Arg Val Trp Gin Leu Tyr 

195 200 205 

Gly Val Arg Glu Ser Val Ser Pro Thr Ala Gin Pro Glu Glu Pro Pro 

210 215 220 

Leu Leu Pro Glu Pro Pro Asp Asn Arg Ser Ser Ala Pro Pro Arg Lys 
225 230 235 240 

Asp Val Pro His Arg Cys Ser Thr His Phe Asp Ala Val Ala Gin He 

245 250 255 

Ara Gly Glu Ala Phe Phe Phe Lys Gly Lys Tyr Phe Trp Arg Leu Thr 

9 260 265 270 

Arg Asp Arg His Leu Val Ser Leu Gin Pro Ala Gin Met His Arg Phe 

275 280 285 

Trp Arg Gly Leu Pro Leu His Leu Asp Ser Val Asp Ala Val Tyr Glu 

290 295 300 

Arg Thr Ser Asp His Lys He Val Phe Phe Lys Gly Asp Arg Tyr Trp 
305 * 310 315 320 

Val Phe Lys Asp Asn Asn Val Glu Glu Gly Tyr Pro Arg Pro Val Ser 

325 330 335 

Asp Phe Ser Leu Pro Pro Gly Gly He Asp Ala Ala Phe Ser Trp Ala 

340 345 350 

His Asn Asp Arg Thr Tyr Phe Phe Lys Asp Gin Leu Tyr Trp Arg Tyr 

355 360 365 

Asp Asp His Thr Arg His Met Asp Pro Gly Tyr Pro Ala Gin Ser Pro 

370 375 380 

Leu Trp Arg Gly Val Pro Ser Thr Leu Asp Asp Ala Met Arg Trp Ser 
385 ~ 390 395 400 

Asp Gly Ala Ser Tyr Phe Phe Arg Gly Gin Glu Tyr Trp Lys Val Leu 

405 410 415 

Asp Gly Glu Leu Glu Val Ala Pro Gly Tyr Pro Gin Ser Thr Ala Arg 

420 425 430 

Asp Trp Leu Val Cys Gly Asp Ser Gin Ala Asp Gly Ser Val Ala Ala 

435 440 445 

Gly Val Asp Ala Ala Glu Gly Pro Arg Ala Pro Pro Gly Gin His Asp 

45O 455 460 

Gin Ser Arg Ser Glu Asp Gly Tyr Glu Val Cys Ser Cys Thr Ser Gly 
46 5 " 470 475 480 
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Ala Ser Ser Pro Pro Gly Ala Pro Gly Pro Leu Val Ala Ala Thr Met 

485 490 495 

Leu Leu Leu Leu Pro Pro Leu Ser Pro Gly Ala Leu Trp Thr Ala Ala 

500 505 510 

Gin Ala Leu Thr Leu 
515 



(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 



Arg Xaa Lys Arg 
1 
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What is claimed: 

1 . A purified polynucleotide or fragment thereof derived from human matrix 
metalloproteinase 19 (MMP19) gene wherein said purified polynucleotide is capable of 
selectively hybridizing to the nucleic acid of said MMP19 gene, and wherein said purified 
polynucleotide has at least 50% identity to SEQ ID NO: 1 and fragments, analogs or 
complements thereof. 

2 . The purified polynucleotide of Claim 1 wherein said purified polynucleotide is 
produced by recombinant techniques. 

3 . The purified polynucleotide of Claim 2 wherein said polynucleotide 
produced by recombinant techniques comprises a sequence of at least one epitope encoded 
by MMP19. 

4 . The purified polynucleotide of Claim 3 wherein said epitope is selected 
from the group consisting of SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, and SEQ 
ID NO: 17. 

5 . A purified polynucleotide of MMP1 9 which encodes an MMP1 9 protein 
comprising an amino acid sequence which has at least 40% identity to SEQ ID NO: 10. 

6 . The purified polynucleotide of Claim 5 which encodes an MMP 1 9 protein 
having SEQ ID NO: 10. 

7 . A recombinant expression system comprising a nucleic acid sequence that 
encodes an open reading frame derived from MMP19 which is operably linked to a control 
sequence compatible with a desired host and wherein said nucleic acid sequence is capable 
of selectively hybridizing to the nucleic acid of said MMP19 gene, and wherein said 
purified polynucleotide has at least 50% identity to SEQ ID NO: 1 and fragments, analogs 
or complements thereof. 

8 . A host cell comprising the recombinant expression system of Claim 7. 

9 . An MMP 19 polypeptide wherein said polypeptide has at least 40% identity 
to SEQ ID NO: 10 and fragments thereof. 
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1 0. The polypeptide of Claim 9 which further possesses the ability to cleave 
SEQIDNO: 13. 

1 1 . The polypeptide of Claim 9 selected from the group consisting of SEQ ID 
NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, and SEQ ID NO: 17. 

12. A method of detecting the presence of a target polynucleotide of MMP 1 9 in 
a test sample, comprising: 

(a) contacting said test sample with at least one MMP1 9 specific polynucleotide 
or complement thereof; and 

(b ) detecting the presence of said target polynucleotide of MMP1 9 in the test 

sample; 

wherein said MMP19 specific polynucleotide has at least 50% identity to polynucleotide 
SEQ ID NO: 1 and fragments, analogs or complements thereof. 

13. The method of Claim 14 wherein said target polynucleotide of MMP 19 is 
attached to a solid phase prior to performing step (a). 

14. A method for detecting mRN A of MMP 1 9 in a test sample, comprising: 

(a) contacting said test sample with at least one primer; 

(b) performing reverse transcription in order to produce cDNA; 

(c) amplifying said cDNA obtained from step (b) by using other 
oligonucleotide primer(s) of MMP 19 as sense and antisense primer(s) in a first-stage 
amplification to obtain MMP19 amplicon; 

(d) detecting the presence of said MMP19 amplicon in the test sample; 
wherein said oligonucleotide primers of MMP19 have at least 50% identity to SEQ ED NO: 
1 and fragments, analogs or complements thereof. 

15. A method of detecting target MMP 19 polynucleotide in a test sample 
suspected of containing said target, comprising: 

(a) contacting said target MMP19 polynucleotide with at lease on MMP19 
oligonucleotide as sense primer and with at least one MMP 19 oligonucleotide as an 
antisense primer and amplifying same to obtain a first stage reaction product; 

(b) contacting said first stage reaction product with at least one other MMP 19 
oligonucleotide, with the proviso that the other MMP 19 oligonucleotide is located 3' to the 
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MMP19 oligonucleotides utilized in step (a) and is complementary to said first stage 
reaction product; and 

(c) detecting said target MMP 1 9 polynucleotide, 
wherein said MMP1 9 oligonucleotides utilized in step (a) and step (b) have at least 50% 
identity to SEQ ID NO: 1 and fragments, analogs or complements thereof. 

1 6. The method of Claim 14 or Claim 15 wherein said test sample is reacted 
with a solid phase prior to performing step (a) or step (b) or step (c). 

1 7 . The method of Claim 1 4 or Claim 1 5 wherein said detection step comprises 
utilizing a detectable label capable of generating a measurable signal. 

18. A method for identifying an inhibitor compound of MMP 1 9 comprising the 
steps of: 

(a) providing a reaction mixture comprising 

(i) a substrate; 

(ii) an MMP 19 protein having at least 40% identity to SEQ ID NO: 10 or 
fragments thereof, said MMP19 protein further possessing the ability to cleave said 
substrate; and 

(iii) a compound of interest; 

(b) incubating the reaction mixture; and 

(c) determining the extent of cleavage of the substrate in the reaction mixture. 

19. The method of Claim 1 8 wherein said substrate is SEQ ID NO: 13. 

20. The method of Claim 19 wherein said MMP19 protein is SEQ ID NO: 10. 

21. An antibody which specifically binds to at least one epitope encoded by 
MMP 19, wherein said antibody is polyclonal or monoclonal and wherein said epitope 
comprises an amino acid sequence selected from the group consisting of SEQ ID NO: 14, 
SEQ ID NO: 15, SEQ ID NO: 16, and SEQ ID NO: 17. 

22 . An assay kit for determining the presence of MMP 1 9 antigen or antibody in 
a test sample, comprising a container containing an antibody which specifically bins to 
MMP 19 antigen, wherein said antigen comprises at least one epitope of MMP 19 having at 
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least a sequence selected from the group consisting of SEQ ID NO: 14, SEQ ID NO: 15, 
SEQ ID NO: 16, and SEQ ID NO: 17. 

23. The kit of Claim 28 wherein said antibody is attached to a solid phase. 

24. A test kit useful for detecting MMP19 polynucleotide in a test sample, 
comprising a container containing at least one MMP19 polynucleotide which is capable of 
selectively hybridizing to the nucleic acid of MMP19 gene and wherein said purified 
polynucleotide has at least 50% identity to SEQ ID NO: 1 an fragments, analogs or 
complements thereof. 
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FIG . 1 



xxxxxxxxxxxxxxxxxxxLxxLLLLxxxxxxxLxxxxxxxxxxxxxxxxxxxxxYLxxYYxLx 
xxxxxxxxxxxxxxxxxxxLxxMQxFFGLxVTGKLDxxTLEx^ 

FxLxPxxPKWxxxxxTYRIxJSTYTPDLxxxx^^ 
xADIMIxFAxxEHGDxxPFIX^PGGxLAHAFxPGPGIGGD^ 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx^ 

xy vYvvvvvvvYYxxxxxxxxxxxxxxxxxxxxxxxxG}cNLFLVMHExGHSLGLxHSxDPx^ 
ItfYPxYxxFxDxxxFxLxxDDIxGIQxLYGxxxxxxxxxxxxxx^ 

xxxxxxxxxxxx^xxxxxxxxx^xPxxCDxxxxxxxxxxxxFDAIxxxRGExFFFKDRFFWRxx 
xxxxxxxxxxxLIxxFWPxLPxxIDAAYExxxxxxVFFFKGxxYWxYxGxxxxxGYPxxIxxxL 
GFPxxVxxIDAAxxxxxxxxKTYFFxx^xYWRYDExxxxMDPGYPKxIxxxFxGIxxxVDAVFx 
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FIG. 2 



* * ★ * * 

CCTNCACTNGATTNAGCGATGCCATNAGAGCGTTTNAGTGGGTGTNCCAG 
XTXXSDAXRAFXWVXQ 





* * * * 



CTACCTGTNAGCGGCGTGTTGGACCGCGCCACCCTGCGCCAGATGANTNG 
LPVSGVLDRATLRQMXX 



***** 
TCCCCGGTGCGGGGTTACAGATACCAACAGTTATGNGGCCTGGGNTGAGA 
P R C G V T D TNSYXAWXE 




***** 
GGATCAGTGANTTGTTTGCTAGACAACGGACCAAAATGAGGCGTAAGAAA 
RISXLFARQRTKMR R K K 





* * * 



CGNTTTGCAAAGCAAGGTAACAAATGGTANAAGCAGCACCTNTTCTACCG 
RFAKQGNKWXKQHLFYR 




XVNWAEXVPEPAVGGA 
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FIG. 3 
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FIG. 4(a) 



pro-peptide 

AAAGCTCCCACCTCCACTCGATTCAGCGATGCCATCAGAGCGTTTCAGTG 
TTTCGAGGGTGGAGGTGAGCTAAGTCGCTACGGTAGTCTCGCAAAGTCAC 
KAPTSTRFSDAIRAFQW 



GGTGTCCCAGCTACCTGTCAGCGGCGTGTTGGACCGCGCCAACCTGCGCC 
CCACAGGGTCGATGGACAGTCGCCGCACAACCTGGCGCGGTTGGACGCGG 
VSQLPVSGVLDRANLR 



AGATGACTCGTCCCCGCTGCGGGGTTACAGATACCAACAGTTATGCGGCC 
TCTACTGAGCAGGGGCGACGCCCCAATGTCTATGGTTGTCAATACGCCGG 
q M T R P R C G V T D T N S Y A A 



TGGGCTGAGAGGATCAGTGACTTGTTTGCTAGACACCGGACCAAAATGAG 
ACCCGACTCTCCTAGTCACTGAACAAACGATCTGTGGCCTGGTTTTACTC 
WAERI SDLFARHRTKMR 

catalytic domain 

GCGTAAGAAACGCTTTGCAAAGCAAGGTAACAAATGGTACAAGCAGCACC 
CGCATTCTTTGCGAAACGTTTCGTTCCATTGTTTACCATGTTCGTCGTGG 
R K K R FAKQGNKWYKQH 



TCTCCTACCGCCTGGTGAACTGGCCTGAGCATCTGCCGGAGCCGGCAGTT 
AGAGGATGGCGGACCACTTGACCGGACTCGTAGACGGCCTCGGCCGTCAA 
LSYRLVNWPEHLPEPAV 



CGGGGCGCCGTGCGCGCCGCCTTCCAGTTGTGGAGCAACGTCTCAGCGCT 
GCCCCGCGGCACGCGCGGCGGAAGGTCAACACCTCGTTGCAGAGTCGCGA 
RGAVRAAFQLWSNVSAL 



GGAGTTCTGGGAGGCCCCAGCCACAGGCCCCGCTGACATCCGGCTCACCT 
CCTCAAGACCCTCCGGGGTCGGTGTCCGGGGCGACTGTAGGCCGAGTGGA 
EFWEAPATGPADIRLT 



TCTTCCAAGGGGACCACAACGATGGGCTGGGCAATGCCTTTGATGGCCCA 
AGAAGGTTCCCCTGGTGTTGCTACCCGACCCGTTACGGAAACTACCGGGT 
FFQGDHNDGLGNAFDGP 



GGGGGCGCCCTGGCGCACGCCTTCCTGCCCCGCCGCGGCGAAGCGCACTT 
CCCCCGCGGGACCGCGTGCGGAAGGACGGGGCGGCGCCGCTTCGCGTGAA 
GGALAHAFLPRRGEAHF 



CGACCAAGATGAGCGCTGGTCCCTGAGCCGCCGCCGCGGGCGCAACCTGT 
GCTGGTTCTACTCGCGACCAGGGACTCGGCGGCGGCGCCCGCGTTGGACA 
DQDERWSLSRRRGRNL 
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FIG . 4(b) 

TCGTGGTGCTGGCGCACGAGATCGGTCACACGCTTGGCCTCACCCACTCG 
AGCACCACGACCGCGTGCTCTAGCCAGTGTGCGAACCGGAGTGGGTGAGC 
F V V h A HEIGHTLGLTHS 



CCCGCGCCGCGCGCGCTCATGGCGCCCTACTACAAGAGGCTGGGCCGCGA 
GGGCGCGGCGCGCGCGAGTACCGCGGGATGATGTTCTCCGACCCGGCGCT 
PAPRALMAPYYKRLGRD 



CGCGCTGCTCAGCTGGGACGACGTGCTGGCCGTGCAGAGCCTGTATGGGA 
GCGCGACGAGTCGACCCTGCTGCACGACCGGCACGTCTCGGACATACCCT 
ALLSWDDVLAVQ SLYG 

hinge region 

AGCCCCTAGGGGGCTCAGTGGCCGTCCAGCTCCCAGGAAAGCTGTTCACT 
TCGGGGATCCCCCGAGTCACCGGCAGGTCGAGGGTCCTTTCGACAAGTGA 
KPLGGSVAVQLPGKLFT 



GACTTTGAGACCTGGGACTCCTACAGCCCCCAAGGAAGGCGCCCTGAAAC 
CTGAAACTCTGGACCCTGAGGATGTCGGGGGTTCCTTCCGCGGGACTTTG 
DFETWDSYSPQGRRPET 

hemopexin domain 

GCAGGGCCCTAAATACTGCCACTCTTCCTTCGATGCCATCACTGTAGACA 
CGTCCCGGGATTTATGACGGTGAGAAGGAAGCTACGGTAGTGACATCTGT 
QGPKYCHSSFDAITVD 



GGCAACAGCAACTGTACATTTTTAAAGGGAGCCATTTCTGGGAGGTGGCA 
CCGTTGTCGTTGACATGTAAAAATTTCCCTCGGTAAAGACCCTCCACCGT 
RQQQLYIFKGSHFWEVA 



GCTGATGGCAACGTCTCAGAGCCCCGTCCACTGCAGGAAAGATGGGTCGG 
CGACTACCGTTGCAGAGTCTCGGGGCAGGTGACGTCCTTTCTACCCAGCC 
ADGNVSEPRPLQERWVG 



GCTGCCCCCCAACATTGAGGCTGCGGCAGTGTCATTGAATGATGGAGATT 
CGACGGGGGGTTGTAACTCCGACGCCGTCACAGTAACTTACTACCTCTAA 
LPPNI EAAAVSLNDGD 



TCTACTTCTTCAAAGGGGGTCGATGCTGGAGGTTCCGGGGCCCCAAGCCA 
AGATGAAGAAGTTTCCCCCAGCTACGACCTCCAAGGCCCCGGGGTTCGGT 
FYFFKGGRCWRFRGPKP 



GTGTGGGGTCTCCCACAGCTGTGCCGGGCAGGGGGCCTGCCCCGCCATCC 
CACACCCCAGAGGGTGTCGACACGGCCCGTCCCCCGGACGGGGCGGTAGG 
VWGLPQLCRAGGLPRHP 



TGACGCCGCCCTCTTCTTCCCTCCTCTGCGCCGCCTCATCCTCTTCAAGG 
ACTGCGGCGGGAGAAGAAGGGAGGAGACGCGGCGGAGTAGGAGAAGTTCC 
DAALFFPPLRRLILFK 
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FIG • 4(C) 



GTGCCCGCTACTACGTGCTGGCCCGAGGGGGACTGCAAGTGGAGCCCTAC 
CACGGGCGATGATGCACGACCGGGCTCCCCCTGACGTTCACCTCGGGATG 
GARYYVLARGGLQVEPY 

TACCCCCGAAGTCTGCAGGACTGGGGAGGCATCCCTGAGGAGGTCAGCGG 
ATGGGGGCTTCAGACGTCCTGACCCCTCCGTAGGGACTCCTCCAGTCGCC 
YPRSLQDWGGIPEEVSG 



CGCCCTGCCGAGGCCCGATGGCTCCATCATCTTCTTCCGAGATGACCGCT 
GCGGGACGGCTCCGGGCTACCGAGGTAGTAGAAGAAGGCTCTACTGGCGA 
ALPRPDGSIIFFRDDR 



ACTGGCGCCTCGACCAGGCCAAACTGCAGGCAACCACCTCGGGCCGCTGG 
TGACCGCGGAGCTGGTCCGGTTTGACGTCCGTTGGTGGAGCCCGGCGACC 
YWRLDQAKLQATTSGRW 



GCCACCGAGCTGCCCTGGATGGGCTGCTGGCATGCCAACTCGGGGAGCGC 
CGGTGGCTCGACGGGACCTACCCGACGACCGTACGGTTGAGCCCCTCGCG 
ATELPWMGCWHANSGSA 



CCTGTTCTGAAGGCACCTCCTCACCTCAGAAACTGGTGGTGCTCTCAGGG 
GGACAAGACTTCCGTGGAGGAGTGGAGTCTTTGACCACCACGAGAGTCCC 
L F * 

CAAAATCATGTTCCCCACCCCCGGGGCAGAACCCCTCTTAGAAGCCTCTG 
GTTTTAGTACAAGGGGTGGGGGCCCCGTCTTGGGGAGAATCTTCGGAGAC 

AGTCCCTCTGCAGAAGACCGGGCAGCAAAGCCTCCATCTGGAAGTCTGTC 
TCAGGGAGACGTCTTCTGGCCCGTCGTTTCGGAGGTAGACCTTCAGACAG 

TGCCTTTGTTCCTTGAAGAATGCAGCATTGTCTTTGTCTGTCCCCACCAC 
ACGGAAACAAGGAACTTCTTACGTCGTAACAGAAACAGACAGGGGTGGTG 

ATGGAGGTGGGGGTGGGATCAATCTTAGGAAAAGCAAMMGGGTCCCAG 
TACCTCCACCCCCACCCTAGTTAGAATCCTTTTCGTTTTTTCCCAGGGTC 

ATCCCTTGGCCCTTTCCTCCGAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIG. 5(a) 



1 50 



MMP 19 

Matrilysin MRLTVLC AVCL.LPGSL ALPLPQEAGG 

Gelatinase A MEALMARG ALTGPLRALC LLGCLLSHA . AAAPSPIIKF 

Gelatinase B MSLWQ PL. . .VLVLL VLGCCFAAP. RQRQSTLVLF 

Stromelysin 1 MKSLPILL LLCVAVCSAY PLDGAARGED 

Stromelysin 2 MMHLAFLV LLCLPVCSAY PLSGAAKEED 

Stromelysin 3 MAPAAWLRSA AARALLPPML LLLLQPPPLL ARALPPDVHH 

Collagenase 1 M HSFPPLLLLL FWGV.VSHSF PATLETQEQD 

Collagenase 2 M FSLKTLPFLL LLHVQISKAF P..VSSKEKN 

Collagenase 3 MHPGVLAAFL FLSWTHCRAL PLPSGGDEDD 

MMP12 MKFLLIL LLQATASGAL PLNSSTSLEK 

MMP18 MNCQQLWLG FLLPMTVSGR VLGLAEVAPV 

mtlMMP , . . .MSPAPR PSRC LL LPLLTLGTAL ASLGSAQSSS 

mt2MMP MGSDPSAPGR PGWTGSLLGD REEAARPRLL PLLLVLLGCL GLGVAAEDAE 

mt3MMP MILLTFSTGR RLDFVHHSGV FFLQTLLWIL CATVCGTEQY 

mt4MMP 

Consensus L — LL LLLL-L — AL PL--S — E — 



51 



MMP 1 9 

Matrilysin 
Gelatinase A 
Gelatinase B 
Stromelysin 1 
Stromelysin 2 
Stromelysin 3 
Collegenase 1 
Collegenase 2 
Collegenase 3 
MMP12 
MMP18 
mtlMMP 
mt2MMP 
mt3MMP 
mt4MMP 



MSELQWEQAQ 
PGDVAPK . TD 
PGDLRTNLTD 
TS. . .MNLVQ 
SN. . . KDLAQ 

LH AE 

V DLVQ 

T KTVQ 

LSEEDLQFAE 
NNVL . . . FGE 
DYLSQYGYLQ 

FS PE 

VH AE 

FN VE 



DYLKRFYLYD 
KELAVQYL . . 
RQLAEEYLYR 
KYLENYYDLK 
QYLEKYYNLE 

RR 

KYLEKYYNLK 
DYLEKFYQLP 
RYLRSYYH . P 
RYLEKFYGLE 

KPLE 

AWLQQYGYLP 
NWLRLYGYLP 
VWLQKYGYLP 



KA 

SETKN 

NTFYGCPKES 
YGYTRVAEMR 
KDVKQFVRRK 
KDVKQF . RRK 
GPQPWHAALP 
NDGRQVEKRR 
SNQYQSTRKN 
TNLAGILKEN 
INKLPVTKMK 

GSNN 

PGDLRTHTQR 
QPSRHMSTMR 
PTSPRMSWR 



PTSTRFSDAI 
. . ANSLEAKL 
CNLFVLKDTL 
GESKSLGPAL 
DSGPW.KKI 
DSNLIV.KKI 
.SSPAPAPAT 
NSGPW.EKL 
GTNVIV . EKL 
AASSMT.ERL 
YSGNLMKEKI 
FKPEDITEAL 
. SPQSLSAAI 
. SAQILASAL 
. SAETMQSAL 



100 

RAFQWVSQLP 
KEMQKFFGLP 
KKMQKFFGLP 
LLLQKQLSLP 
REMQKFLGLE 
QGMQKFLGLE 

QEAPR 

KQMQEFFGLK 
KEMQRFFGLN 
REMQSFFGLE 
QEMQHFLGLK 
RAFQEASELP 
AAMQKFYGLQ 
AEMQRFYGIP 
AAMQQFYGIN 
. . MQQFGGLE 



Consensus 



-YLEKYY-LP 



-R -S L-EAL -EMQKFFGLP 
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MMP19 

Matrilysin 
Gelatinase A 
Gelatinase B 
Stromelysin 1 
Stromelysin 2 
Stromelysin 3 
Collegenase 1 
Collegenase 2 
Collegenase 3 
MMP12 
MMP18 
mtlMMP 
mt2MMP 
mt3MMP 
mt4MMP 

Consensus 



FIG . 5(b) 

101 150 
VSGVLDRANL RQMTRPRCGV TDTNSYAAWA ERISDLFARH RTKMRRKKRF 

ITGMLNSRVI EIMQKPRCGV PDVAEYSLFP N 

QTGDLDQNTI ETMRKPRCGN PDVANYNFFP R 

ETGELDSATL KAMRTPRCGV PDLGRFQTFE G 

VTGKLDSDTL EVMRKPRCGV PDVGHFRTFP G 

VTGKLDTDTL EVMRKPRCGV PDVGHFSSFP G 

PA SSLRPPRCGV PDPSDGLSAR N RQKRF 

VTGKPDAETL KVMKQPRCGV PDVAQFVLTE G 

VTGKPNEETL DMMKKPRCGV PDSGGFMLTP G 

VTGKLDDNTL DVMKKPRCGV PDVGEYNVFP R 

VTGQLDTSTL EMMHAPRCGV PDVHHFREMP G 

VSGQLDDATR ARMRQPRCGL ED PFN QKTLKYLLLG R. . . WRKK. . 

VTGKADADTM KAMRRPRCGV PDKFGAEIKA N . .V. .RRKRY 

VTGVLDEETK EWMKRPRCGV PDQFGVRVKA N LRRRRKRY 

MTGKVDRNTI DWMKKPRCGV PDQ. .TRGSS K FHIRRKRY 

ATG.IDEATL ALMKTPRCSL PDLPVLT QARRRRQ 

VTGKLD — TL E-MR KPRCGV PDVG-F--FP G R-KR - 



MMP19 

Matrilysin 
Gelatinase A 
Gelatinase B 
Stromelysin 1 
Stromelysin 2 
Stromelysin 3 
Collegenase 1 
Collegenase 2 
Collegenase 3 
MMP12 
MMP18 
mtlMMP 
mt2MMP 
mt3MMP 
mt4MMP 



151 

AKQGNKWYKQ 
. . . SPKWTSK 
. . . KPKWDKN 
. . .DLKWHHH 
. . . IPKWRKT 
. . . MPKWRKT 
VLSGGRWEKT 
. . . NPRWEQT 
. . . NPKWERT 
. . . TLKWSKM 
. . .GPVWRKH 

AIQGLKWQHN 
ALTGRKWNNH 
ALTGQKWQHK 
APAPTKWNKR 



HLSYRLVNWP 
WTYRIVSYT 
QITYRIIGYT 
NITYWIQNYS 
HLTYRIVNYT 
HLTYRIVNYT 
DLTYRILRFP 
HLTYRIENYT 
NLTYRIRNYT 
NLTYRIVNYT 
YITYRINNYT 
HLTFRILNLP 
EITFCIQNYT 
HLTFSIQNYT 
HITYSIKNVT 
NLSWRVRTFP 



EH . . LPEPAV 
RD. .LPHITV 
PD. . LDPETV 
ED. .LPRAVI 
PD. . LPKDAV 
LPRDAV 
LVQEQV 
LPRADV 
LSEAEV 
PD. .MTHSEV 
PD. .MNREDV 
ST. . LPPHTA 
PK. . VGEYAT 
EK. . LGWYHS 
PK. . VGDPET 
RDSPLGHDTV 



PD. . 
WQ. , 
PD. 
PQ. 



RGAVRAAFQL 
DRLVSKALNM 
DDAFARAFQV 
DDAFARAFAL 
DSAVEKALKV 
DSAIEKALKV 
RQTMAEALKV 
DHAIEKAFQL 
ERAIKDAFEL 
EKAFKKAFKV 
DYAIRKAFQV 
RAALRQAFQD 
YEAIRKAFRV 
MEAVRRAFRV 
RKAIRRAFDV 
RALMYYALKV 



200 

WSNVSALEFW 
WGKEIPLHFR 
WSDVTPLRFS 
WSAVTPLTFT 
WEEVTPLTFS 
WEEVTPLTFS 
WSDVTPLTFT 
WSNVTPLTFT 
WSVASPLIFT 
WSDVTPLNFT 
WSNVTPLKFS 
WSNVAPLTFQ 
WESATPLRFR 
WEQATPLVFQ 
WQNVTPLTFE 
WSDIAPLNFH 



Consensus 
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